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Preface 



The symposia on Fundamentals of Computation Theory are held every two years. 
Following the tradition established at the first FCT 1977, the conference brings 
together specialists in theoretical fields of Computer Science from various coun- 
tries and stimulates mathematical research in theoretical computer science. To- 
pics of interest for the satellite workshop on Efficient Algorithms WEA 2001 are: 
computational complexity, graph and network algorithms, flow and routing al- 
gorithms, coloring and partitioning, cuts and connectivity, packing and covering, 
scheduling algorithms, approximation algorithms, inapproximability results, on- 
line problems, randomized algorithms, integer programming, semidefinite pro- 
gramming, algorithmic geometry, polyhedral combinatorics, branch and bound 
algorithms, cutting plane algorithms, and various applications. 

The 13th FCT was held in Riga-Lielupe, August 22-24, 2001 with an ad- 
ditional day (August 25) for the satellite workshop WEA 2001. The previous 
meetings were held in the following cities: 

— Poznan-Kornik, Poland, 1977 

— Wendish-Rietz, Germany, 1979 

— Szeged, Hungary, 1981 

— Borgholm, Sweden, 1983 

— Cottbus, Germany, 1985 

— Kazan, Russia, 1987 

— Szeged, Hungary, 1989 

— Gosen-Berlin, Germany, 1991 

— Szeged, Hungary, 1993 

— Dresden, Germany, 1995 

— Krakow, Poland, 1997 

— Iasi, Romania, 1999 

This year the number of submitted papers was high. The Program Committee 
decided to accept 28 submissions as regular papers and 15 submissions as short 
papers. Additionally, the Program Committee of WEA 2001 accepted 8 papers. 
This volume contains all the contributed papers. The invited speakers were: 

— Janis Barzdins (Riga, Latvia) 

— Gilles Brassard (Montreal, Canada) 

~ Klaus Jansen (Kiel, Germany) 

— Juhani Karhumaki (Turku, Finland) 

~ Marek Karpinski (Bonn, Germany) 

— Boris Plotkin (Jerusalem, Israel) 

~ Umesh Vazirani (Berkeley, USA) 
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The invited speakers for WEA2001 were: 

— Foto Afrati (Athens, Greece) 

— Andreas Brandstadt (Rostock, Germany) 

— Luisa Gargano (Salerno, Italy) 

~ Stephane Perennes (Sophia-Antipolis, France) 

All papers published in the workshop proceedings were selected by the pro- 
gram committee on the basis of referee reports. Fach paper was reviewed by at 
least three referees who judged the papers for originality, quality, and consistency 
with the topics of the conference. The Program Gommittee consisted of: 

— Andris Ambainis (Berkeley, USA) 

— Setsuo Arikawa (Fukuoka, Japan) 

— Yuri Breitbart (Bell Labs, USA) 

— Gristian Galude (Auckland, New Zealand) 

— Bogdan Ghlebus (Warsaw, Poland) 

— Alexander Dikovsky (Nantes, France) 

— Rusins Freivalds (Riga, Latvia), Ghair 

— Jozef Gruska (Brno, Gzech Republic) 
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— Aleksander Letichevskij (Kiev, Ukraine) 

— Andrzej Lingas (Lund, Sweden) 

— Ghristoph Meinel (Trier, Germany) 

— Valery Nepomniaschy (Novosibirsk, Russia) 

— Jaan Penjam (Tallinn, Estonia) 

— Anatol Slissenko (Paris, France) 

— Garl Smith (Maryland, USA) 

— Esko Ukkonen (Helsinki, Finland) 

— Eli Upfal (Brown, USA) 

— Mars Valiev (Moscow, Russia) 

The Program Gommittee of the Workshop on Efficient Algorithms WEA 
2001 worked in close touch with the Program Gommittee of FCT 2001 but 
made their decisions separately. This Program Gommittee consisted of: 

— Yossi Azar (Tel-Aviv, Israel) 

— Evripidis Bampis (Evry, France), Go-chair 

— Michael Bender (Stony Brook, USA) 

— Thomas Erlebach (Zurich, Switzerland) 

~ Wenceslas Fernandez De La Vega (Orsay, France) 

— Wen-Lian Hsu (Academia Sinica, Taiwan) 

— Klaus Jansen (Kiel, Germany), Go-chair 

— Elias Koutsoupias (Athens, Greece and Los Angeles, USA) 

— Jan Kratochvil (Prague, Gzech Republic) 

— Seffi Naor (Haifa, Israel) 

— Takao Nishizeki (Tohoku, Japan) 
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Towards Axiomatic Basis 
of Inductive Inference* * 



Janis Barzdins^, Rusins Freivalds^, and Carl H. Smith^ 



^ Institute of Mathematics and Computer Science, 
University of Latvia, Raina bulv. 29, Riga, Latvia* * * 

^ Department of Computer Science, University of Maryland, U.S.Ad 



Abstract. The language for the formulation of the interesting state- 
ments is, of course, most important. We use first order predicate logic. 
Our main achievement in this paper is an axiom system which we believe 
to be more powerful than any other natural general purpose discovery 
axiom system. 

We prove soundness of this axiom system in this paper. Additionally 
we prove that if we remove some of the requirements used in our axiom 
system, the system becomes not sound. We characterize the complexity 
of the quantifier prefix which guaranties provability of a true formula via 
our system. We prove also that if a true formula contains only monadic 
predicates, our axiom system is capable to prove this formula in the 
considered model. 



1 Introduction 

The term ’’inductive inference” is used for synthesis of general regularities from 
particular examples. Different formalizations of the inductive inference problems 
are possible. Since the seminal paper by E.M. Gold the computational 

learning theory community has been under hypnosis presuming that the main 
problem in the learning theory is to restore a grammar from samples of language 
or a program from its sample computations. However scientists in physics and 
biology have become accustomed to looking for interesting assertions rather than 
for a universal theory explaining everything. The question arises what language 
is to be used to formulate these assertions. 

One of the most universal languages to formulate such assertions is the first- 
order predicate logics. Hence along with Gold-style inductive inference it is nat- 
ural to investigate inference of assertions in predicate logics (’’formulas”) from 
local observations of the given model (’’elementary facts”). Research in this di- 
rection was started already in | |OSW91| and iBEsna. 

* This project was supported by an International Agreement under NSF Grant 
9421640. 

* * * Research supported by Grant No. 01. 0354 from the Latvian Gouncil of Science, Con- 
tract IST-1999-11234 (QAIP) from the European Commission, and the Swedish In- 
stitute, Project ML-2000 
t Supported in part by NSF Grant CCR-9732692 
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We can look on the inductive inference from another viewpoint as well, 
namely, how the inductive inference process is defined. Traditionally, ’’identi- 
fication in the limit” is considered IkloldO . We say that the inductive inference 
machine (IIM) identifies the given object (for instance, a function) in the limit, 
if reading from the input more and more input-output examples IIM produces 
a correct hypothesis (for instance, a correct Godel number of the target func- 
tion) and never more changes it (in spite of working on for infinitely long time) . 
Modifications of the ’’identification in the limit” are considered. For instance 
at the finite (or, ” one-shot”) identification the IIM processes the infinitely long 
sequence of the values of the target function, and at some finite moment the IIM 
produces only one correct output. Of course, the capability of finite identification 
is much more limited. 

In the paper a new type of inductive inference ’’learning with confi- 

dence” ( or ’’learning with belief levels”) is considered. In some sense this notion 
is closer to the human inference process. When we produce any hypothesis based 
on a finite number of observations we are tended estimate some belief level of 
our hypothesis. This way, we come to a natural inductive inference type ’’learn- 
ing first-order predicate logic formulas with belief levels from elementary facts”. 
By ’’elementary facts” we understand values of the given predicates for specific 
values of the arguments. This inductive inference type is central in our paper. 

We try to axiomatize the possibilities of this inference type. We believe that 
we have succeeded only thanks to the naturality of this notion. We have no idea 
how one could axiomatize the possibilities of identifiability in the limit. One of 
reasons of this difficulty is this identification type’s remoteness from the natural 
human inference. 

Why we believe in great importance of the axiomatizability problem? The aim 
of axiomatization is to find the basic elements of which our practical everyday 
reasoning is constructed. 

Remember the axioms of Euclidean geometry, axioms of first-order pred- 
icate logics (in these cases a complete axiomatization was achieved), axioms 
of Peano arithmetics, Zermelo-Fraenkel axiom system for the set theory. The 
latter systems are incomplete, and they cannot be made complete (by Godel in- 
completeness theorem). However this does not challenge the importance of this 
axiomatization. For instance, when we speak of Peano axiom system |Peano| . 
then all natural assertions in arithmetics can be proved in Peano arithmetics 
(however not always in a trivial way) but all the counterexamples provided by 
the proof of the Godel incompleteness theorem are highly artificial. 

David Hilbert’s system of axioms of the elementary geometry |Hilbl879j has 
a similar status. Being a more precise version of Euclidean axioms, this system 
is not a complete system (by a corollary from Godel incompleteness theorem). 
Nonetheless Hilbert’s system is considered to be the ’’standard”. 

A similar status is hold also by Zermelo-Fraenkel axiom system for the set 
theory izEm. 

We return to the inductive inference. Until recently, it was investigated only 
from a pragmatic viewpoint without a deeper search of the mathematical foun- 
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dations. At first it seems that inductive inference and axiomatization are incom- 
patible notions because axiomatization traditionally is considered as a preroga- 
tive of deductive systems. However inductive inference processes (learning from 
examples) performed by distinct persons show great similarity. This prompts 
existence of objective regularities in the inductive inference. On the other hand, 
axiomatization of inductive inference presumes search for objective regularities. 
It is evident that axioms of inductive inference should differ from axioms of de- 
ductive systems, because the latter serve for deducing absolute truth while the 
former are only hypotheses with higher or less high belief level. 



2 Learning Formulas with Belief Levels 

First of all we go over the necessary logical concepts that will be used. A model 
will be a triple {E, N, I) where 17 is a finite set of predicates, called a sig- 
nature, with designated arities, N is the domain of the variables used in the 
predicates and I is the interpretation of the predicates. Unless otherwise noted, 
the domain will be the natural numbers, IM. For example consider the model 
Mq = (Aq, IN, /q) where Aq contains three binary predicates, P\, P 2 and P3. The 
interpretation /g is given by three formulas: P\{x,y) '■ x <y, p 2 {x,y) : y = 5x 
and P^{x, y) : y = x'^ . The elementary faets of a model are all of the instantiated 
predicate symbols of the model with associated truth values. The elementary 
facts of our example model Mq include Pi (2, 5) = T, Pi (6, 2) = P, Pi (4, 5) = T, 
P2(2, 10) = T, P3(3, 10) = P. In some of the proofs that follow it will be more 
convenient to list these elementary facts as Pi(2, 5), -'Pi(6, 2), Pi (4, 5), P2(2, 10), 
-'P3(3, 10). A term is either a notation for a member of the domain or a vari- 
able. The elementary formulae of a model M = {E, N, I) include are formulae 
of the form P(ni, • • • , n^) for P a k-axy predicate in E and each ni (1 < i < k) 
is a term. Next, formula of the first-order predicate logics, validity, and any 
other logical notions are defined in the traditional way (see any textbook, e.g. 
Ilkleenetel . 

Let M = < A, , IN, / > be a model. By Em we denote the set of all elementary 
facts of the model M. By Dm we denote the set of all enumerations of elements 
of Em- 

Let some standard axiom system A for the first order predicate calculus, 
(e.g. that from |K1eene5^ be fixed. If it is possible to derive a formula / from 
the elementary facts of some model M via the axiom system, we write M\ — /. 
Notice that if Pm is, for example, {-'P(O), P(l), • • •} then M\ — 3a;(P(a:)). This 
follows from the “P(l) ^ 3x(P(a;))” being in A. 

Now we recall some notions and results from [BFSflfij and |BFS97j to be used 
below. 

The main notion from |BFS96j that we need is that of ” belief level” . A belief 
level is a rational number between 0 and 1, inclusive. A 0 represent a vote of no 
confidence and 1 indicates absolute certainty. 
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Definition 1. A belief formula learning (abbreviated BFL) maehine is an al- 
gorithmie device that takes as input an enumeration of the elementary facts of 
some model and produces as output first order formulae paired with belief levels. 

A BFL produces a sequence of outputs, like an FFL, but unlike the FFL 
case, the same formula may appear over and over again, with different belief 
levels. If L is BFL and D is its input data and / is a first order formula, then 
B{L, D, f) = (6i, 62, ■ ■ •) is the sequence of belief levels associated with / that 
L produces when given D as input, in the order of appearance in the output 
stream. 

Definition 2. We say that a BFL L e-learns formula f from the input data D 
iff B{L, D, f) monotonically converges to 1. The set of formulas produced this 
way is denoted by L{D). 



Definition 3. A BFL L is correct iff for all models M and all D G Dm, all the 
formulas in L{D) are true for the model M . 



Definition 4. A correct BFL L is called absolutely best if L'{D) C L{D) holds 
for arbitrary correct BFL L' , for arbitrary model M and arbitrary enumeration 
of elementary facts D G Dm ■ 



Theorem 1. (Theorem 24 from IBFLD'^) There is no absolutely best BFL. 



Definition 5. A formula f is e-learnable iff there is a correct BFL L which 
e-learns f from every D G Dm of every model M such that f is true in M . 



Theorem 2. (Theorem 20 from \BFS91^ ) Let A be an arbitrary Boolean formula 
with predicate symbols Pi, • • • , P„, and Xi, - ■ ■ ,Xu,yi, - ■ ■ ,Vv be variables of the 
involved predicates. Then the formula 

Vxi • • • \/xu3yi ■ ■ ■ 3yvA{xi,- ■■ ,yv) 



is e-learnable. 



Theorem 3. (Theorem 23 from IBFSO'l^ ) Lf f is a first-order predicate logic 
formula involving only monadic predicates, then f is e-learnable. 
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3 Starting Axiomatization: The First Two Axioms for 
Formula Learning 

We start with introducing a new notion 

Aje 

where A is a formula and e is a rational number with 0 < e < 1. We call the 
number e the belief level of the formula A. The intuitive meaning of A/e is that, 
based on the prior steps of the proof, we have belief level e that the formula A is 
true. Our new axioms will exploit the notion A/e. To formulate the new axioms 
we need one more notion, namely that of weighting function. 

Definition 6. A weighting function is a recursive function ui such that 

OO 

^w(n) = 1. 

n— 0 

For example, let uj{n) = 1/2”+^. The idea of the weighting function is that 
some examples are more relevant than the others. The weighting function will 
be used to determine the belief level of a universally quantified sentence, based 
on known examples. 

In this section along with the symbol | — which denotes the ’’proves” relation 
in traditional logic, we use the symbol ||= to denote the same relation in extended 
axiom system that we now present. For example, the first new axiom below says 
that if something is provable in the standard logic, it is provable with belief level 
1 in our new system. 

Axiom 1 If M\ — A, then M\\=A/1. 

Axiom 2 If A contains a free variable x , and M\\=A{ni) / e\, . . . ,M\\=A{nk) / Cu, 
where G IN and A^nf) is obtained from A by substituting Ui for all occurences 
o/ x, f = I, 2, • • • , /c, then M\\='ixA{x) / e, where e = co{ni).ei + . . . + w(nfe).efc. 

Definition 7. We say that a formula A is e-provable in the model M via the 
axiom system 12 iff for arbitrary enumeration of elementary facts D G Dm and 
for arbitrary e > 0 there is an e > 1 — e such that M||=A/e 

It is easy to see that for axiom systems 12 containing only Axiom 1, Axiom 2 
and the axioms Axiom 3’ , Axiom 3” or Axiom 3 described below the following 
theorem holds. 

Theorem 4. The class of e-provable formulas does not depend on the particular 
weighting function. 

The axioms Axiom 1 and Axiom 2 seem to be natural enough. However this 
system of two axioms is rather weak because the two incorporated mechanisms 
(the possibilities of the standard logic and the belief levels) are rather disjoint. 
After usage of Axiom 2 there is no way to apply Axiom 1. We need a mechanism 
to combine Axioms 1 and 2. 
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4 A Naive Third Axiom 

Axiom 3’ If: 

a) ^ B), 

b) M|M/e, 
then M\\=B/e. 

Axiom 3’ may be considered to be a good companion for Axioms 1 and 2. 
However we have 

Theorem 5. The system of axioms { Axiom 1, Axiom 2, Axiom 3’ } is not 
sound. 

Proof. We prove that there is a model M and a formula / such that / is 
false in the model M but nonetheless / is e-provable in M via this system of 
axioms. 

Indeed, we consider a model M with one unary predicate P{x) such that 
P{ 1 ) is true and P{ 2 ) is false. We consider the formula \/xP{x) which is false in 
M. Let uj{n) be the weighting function. Axiom 2 provides us VccP(x)/a;(l). 

We construct a formula Qk being 

VxiVx2 • • • yxk{P{xi) V P{X2) V • • • P{xk)) 

A somewhat similar formula being 

VxiVx2 • • ■'ixkiPixi) V P{x2) V • • • P{xk)) => yxP{x) 

is a tautology in the standard predicate logic. Hence, by Axiom 1, it can be 
e-proved with belief level 1. Using Axiom 2 k times, we obtain 

Applying Axiom 3’, we get 

Va;P(x)/l- (l-o;(l))'=. 

If fc — >■ 00 , then the belief level converges to 1. Hence the false formula \/xP{x) 
is e-provable with belief level converging to 1 . K1 



5 A Less Naive Third Axiom 

Why Axiom 3’ caused non-soundness of the axiom system? Where was the 
essence of the proof of Theorem El ? We see that it became possible because 
we allowed too many quantifiers in the formula Qk- 

Definition 8. Let A and B be first order predicate formulas in prenex form. 
We say that formula A is of no higher quantifier complexity than B ( denoted 
by A <quant B ) if the number of quantifiers (no matter, how many quantifier 
alternations are present) in A is less or equal to the number of quantifiers in B. 
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For instance, consider formulas 

Ai : y xyy3z3u{R{l7 , x, z) A 9) V S{y)) 

A 2 : 3x3w3t<;3p((G(5, w) A H{v, 17, x)) ~^K{x,p)) 

A^-.ygH{g,g,n) 

It is easy to see that A\ Aquant 7 I 2 , A 2 Aquant Aquant Aij but 

Al~' 'ZZquant A^. 

Please notice that our definition of diquant ignores the names of the bounded 
variables. 

Axiom 3” If: 

a) M\ — (A B), 

b) A ^quant 7?, 

c) M|^A/e, 
then M|^i3/e. 

However we have 

Theorem 6. The system of axioms { Axiom 1, Axiom 2, Axiom 3” } is not 
sound. 

Proof. We consider a model M where the binary predicate P(x,y) means 
< x”. In this model the formula 3xVj/P(x, y) is false. We consider the formula 
yyP{x, y) with one free variable x. Let k be an arbitrary natural number. Using 
Axiom 2, we can get the belief level 

Cfc = w(l) + w(2) + • • • + Loiki) 

for the formula yyP(k,y). On the other hand, in the classical logic the formula 

'^yP{k,y) 3xyyP{x,y) 

is a tautology. Using Axiom 3” with A = yyP{k, y) and B = 3xVyP(x, y) we get 
the belief level = w(l) + w(2) + • • • + w(fc) for the false formula 3xVyP(x, y) 
not containing k. Since k ^ oo implies Ck — >■ 1, we get belief level converging to 
1 for a false formula 3x\/yP{x,y). K1 



6 Final Third Axiom 

Definition 9. We say that A is of no higher constant complexity than B (de- 
noted by A <const B) if there is no constant which is used in A but not used in 

B. 



For instance, consider formulas 



A\ : VxVy323u(i?(17, x, z) A {P{u, 5) V S{y)) 
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A 2 : BxBvBw((G(5, w) A H{v, 17, cc)) -•K{x, x)) 
A3:ygH{g,g,17) 

It is easy to see that Ai 'Alconst ^ 2 ? A 2 Aconst Aconst Aij 

but A const A^. 

Axiom 3 If: 

a) M\—{A ^ B), 

A 'Alquant 
A 'Alconst 73, 

d) Af ||=A/ e, 
then M\\=B/e. 

7 Soundness of the Axiom System 

We will prove in this section that the axiom system {Axiorui, Axiom 2 , Axiom-i} 
is sound. Let us consider only formulas in a prenex form. The quantifierless body 
of the formula is a Boolean propositional formula made of terms being predicate 
symbols from the finite signature S with constants and variable symbols. 
Consider, for instance, two formulas 

A = VxBg(P(x, 7, g) Q{y, 5), 

B = Mx3y{-oQ{y, 5) V P{x, 7, y). 

They have the same quantifier prefix (with equal names for the bounded 
variables) and the quantifierless bodies are equivalent Boolean formulas of the 
same variables P{x, 7, y) and Q{y, 5). It is easy to conjecture that A is e-provable 
in a model M if and only if B is e-provable in M . To prove such a conjecture we 
consider the notion of reversible equivalence. We use A ^ i? to denote A :<quant 
B A A <const B. 

Definition 10. We say that formulas A and B are reversible equivalent (A 
evB ) in the model M , if M\ — (A B), M\ — {B ^ A), A< B and B < A. 

Definition 11. We say that formulas A and B are equivalent up to names 
(A =rename B ) if they differ only in the names of the hounded variables. 

Definition 12. We say that formulas A and B are generalized reversible equiv- 
alent (A =grev B) in the model M , if there is a formula C such that A =rev C , 
and C B . 

Lemma 1. If M\\=A/e, and A =grev B, then M\\=B/e. 

Proof. A =grev B implies A =rev C, C =rename B, M\ — {A C), and A :< 

B. Hence M\\=C/e by Axiom 3. Finally, the e-proof can be obtained 

from the e-proof M\\=C/e by renaming the variables introduced by usage of 
Axiom 2. K] 
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Lemma 2. For arbitrary formula B, there is only a finite number (up to gen- 
eralized reversible equivalence) of formulas A such that A < B. 

Proof. For the simplicity sake we consider only formulas in prenex form. Let 
S be the set of the constants used in B. There is only a finite number of different 
(up to reversible equivalence) Boolean formulas made of terms consisting of 
predicate symbols from the finite set E, the bounded variables used in B and 
the constants in S. Hence the number of formulas in prenex form is also only 
finite. The proof in the general case uses the same idea but technically it is more 
complicate. K1 



Theorem 7. For arbitrary formula B which is false in the model M , and for 
arbitrary weighting function oj, there is a real number eg < 1 such that e-proofs 
M\\=B/e cannot exist for e > eg. 

Proof. By induction over the number k of quantifiers used in B. 

Basis, k = 0. Obvious. 

k k+1 . Assume that Theorem is true for k or less quantifiers used. 
Assume from the contrary that there is a formula B with k -\- 1 quantifiers such 
that it is false in the model M and M\\=B/e with arbitrarily high e > 1. Consider 
all possible A such that A < B. 

-Some of such formulas A are true. Then A^ B cannot be proved classically 
in the model M. Hence Axiom 3 is not applicable in this case. 

-Some of such formulas A have k or less quantifiers, and A is false. Then, by 
induction assumption. A/e cannot be proved for high enough values of e. Hence 
such an A cannot be used in Axiom 3 to obtain B je with sufficiently high e. 

-Some of such formulas A have the same number fc -I- 1 of quantifiers, and A 
is false. By Lemma Qthere is only a finite number of distinct (up to generalized 
reversible equivalence) such formulas A. Denote them by Ai, A 2 , . . . , A„. How 
M\^Bje can be proved for arbitrarily high e < 1 ? Surely, not by Axiom 1. If 
the outermost quantifier of B is not V, the proof cannot be by Axiom 2. If the 
outermost quantifier of H is V, H is a formula 'ixF{x) and it is false, consider a 
particular value x = a such that F{a) is false. F{a) is a formula with k quantifiers 
and , by induction assumption, it cannot be proved for arbitrarily high values of 
e. Hence the only hope to prove M\\=B/e with arbitrarily high value of e is by 
Axiom 3 using M\ — A^ => B and M\\=Ai/ei with arbitrarily high values of e^. 
Hence for one of Ai, A 2 , . . . , A„ it is needed to prove Aj/e for values of e higher 
than those described in Axiom 2. However we have no means to do this. K1 



8 Usefulness 

Definition 13. We say that formula A is e-provable by axiom system fl iff A 
is e-provable by the axiom system Cl in every model M such that A is true in 
M. 
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This section (and the next one below as well) shows that our axiom system 
{Axiomi, Axiom, 2 , Axiom^} is powerful enough to e-prove the same classes of 
formulas which are e-learnable by Theorems 0 and 0 

Theorem 8. Let A he an arbitrary Boolean formula with predicate symbols 
Pi, - ■ ■ ,Pn, and xi, ■ ■ ■ , Xu,yi, ■ ■ ■ ,Pv be variables of the involved predicates. Then 
the formula 

Vxi • • • \/xu3yi ■ ■ ■ 3yvA{xi,- ■■ ,yv) 
is e-provable by the axiom system {Axiorui, Axiom 2 , Axiom^}. 

Proof. We prove here only a special case of this theorem when the formula is 
\/x3yP{x, y) and there is a sequence {ai, 02 , 03 , • • •} of natural numbers such that 
P{1, a\) is true, P(2, 02 ) is true, etc. The general case is similar and differs only 
in more heavy technical notation. The following sequence of proofs is performed 
via our axiom system. 



P(l,ai) classically 

3yP{l,y) classically 

3yP{l,y) Axiom 1 

'ix3yP{x,y) Axiom 2 
P( 2 , 02 ) classically 

3yP{2,y) classically 

3yP(2, ?/)Axiom 1 

'ix3yP{x,y)/uj{l).l + ui{2).l Axiom 2 
P{3,a3) classically 

3yP{3,y) classically 

3yP{3,y) Axiom 1 



\lx3yP{x,y)/ijj(l) + u}{2) + u)(3) Axiom 2 



9 Monadic Predicates 

Theorem 9. If f is a first- order predicate logic formula involving only monadic 
predicates, then f is e-provable by the axiom system 
{Axiomi, Axiom 2 , Axiom^}. 

Proof. For the simplicity sake we consider only formulas in prenex form. We 
prove our Theorem by induction. For quantifier less formulas Theorem is obvious. 
Assume that Theorem is true for k quantifiers and the target formula / contains 
fc + 1 quantifiers. 

Let Qx be the outermost quantifier of the formula /. Suppose that Pi, ... , Pm 
is a complete list of all monadic predicates in / which contain the variable x. To 
simplify the proof, assume that m = 2. The generalization to larger values of m is 
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obvious. We define 4 formulae fi, f 2, f 3, /t (in general 2 ™ formulae) derived from 
/ substituting, respectively, Pi{x) = T,P2{x) = T for /i, P\{x) = T,P2{x) = F 
for /2, Pi{x) = F,P2 {x) = T for f^, Pi{x) = F,P2{x) = F for f^. It is impor- 
tant that /i,/2j/3)/4 have at most k quantifiers. The target formula is equiv- 
alent to Qx{{Pi{x)&zP2{x)&zfi) V (Pi(a:)&-'P2(a;)&/2) V (-'Pi(a;)&P2(a;)&/3) V 
(-■Pi (a;)&-iP 2 (x)&/ 4 ) ) . 

We consider the cases Q = 3 and Q = V separately. 

- Assume that / is 3xF{x) and it is true for x = 7. Then for x = 7 only one 
part out of four 



(Pi(x)&P 2 (cr)&/i)V 

V(Pi(x)&-'P2(a;)&/2)V 

V(-'Pi(a:)&P2(a;)&/3)V 

V(-Pi(x)&-P2(x)&/4) 

can be true. For instance, if Pi(7)&-'P2(7), then for a: = 7 the second part is 
true. This implies that /2 is true in the model M for all the values of x (because 
/2 does not contain x). By induction assumption, Ml^/ 2 /e for arbitrarily high 
e < 1. Consider Axiom 3 with A = f 2 and B = \/xF{x). We concluded that 
the requirement a) of the Axiom 3 is satisfied. Notice that /2 ^ \/xF{x) since 
/2 contains less quantifiers and not a single constant not in \/xF{x). Hence, the 
requirement b) also is satisfied. 

Since (/2 3xF(x)) is true in the model M, and our classical proofs of the 

validity of formulas in models use all the power of Kleene axiom system, it is 
possible to prove M \ — (/2 3xF{x)). Hence, by Axiom 3, M\\=3xF{x)/e for 

arbitrarily high e < 1. 

-Assume that / is VP(x). For every x separately (e.g., for x = 7) 

P(7) = [(Pi(7)&P2(7)&/i)V 
V(Pi(7)&-P2(7)&/2)V 
V(-Pi(7)&P2(7)&/3)V 
V(-Pi(7)&-P2(7)&/4)] 

In the model M three parts are false but P(7) is true. If, for instance, the 
part 



(Pi(7)&-P2(7)&/2) 
is true, then /2 (not containing x) is true, and 

MH/2 ^ F{7))- 

Axiom 3 is applicable because a), b) and c) are true. By Axiom 2, 

M|hP(l)/e,...,M|hP(fc)/e 

M\\=\/xF{x)/e 



for arbitrarily high e < 1. 
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10 Is This Axiom System Complete? 

There are at least two natural definitions of completeness of axiom systems for 
formula learning with belief levels. 

Definition 14. We say that axiom system f2 is absolutely complete for formula 
learning with belief levels iff for every correct BFL L , every model M and every 
enumeration of elementary facts D S Dm there holds: if f € L{D), then f is 
e-provahle by f2 in the model M . 

It is easy to see that there do not exist absolutely complete axiom systems 
for formula learning with belief levels. We referred in our Section Q Theorem 24 
from [RFS97] asserting that no best correct BFL machine is possible. It is easy 
to observe that arbitrary axiom system for formula learning with belief levels can 
be simulated by a BFL machine such that the machine learns all the formulas 
e-provable via this axiom system. An absolutely complete axiom system would 
generate a best correct BFL machine which is impossible because of Theorem 
24 inmm- 

Definition 15. We say that axiom system fl is practically complete for formula 
learning with belief levels if every formula which is e-learnable by some BFL , is 
also e-provable by the axiom system f2. 

We conjecture that our axiom system {Axiomi, Axiom 2 , Axiom^} is practi- 
cally complete. Theorems |HI and |H| (compared with Theorems |3 and in a sense 

support our conjecture. 

We conclude with an open problem the solution of which would clarify much 
(if not everything) for the solution of our conjecture. 

Open Problem. Let two formulas fi and /2 be provably equivalent in the clas- 
sical logic. Let fl be e-provable in our axiom system with belief level converging 
to 1. Is the other formula /2 also e-provable in our axiom system with belief level 
converging to 1? 

We used in the proof of our Theorem El the fact that provably equivalent 
formulas can have very much different maximal belief levels. However this leaves 
open possibility that our Open Problem can have a positive solution. 
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In this talk we present fast algorithms that find approximate solutions for 
a general class of problems, which are called fractional packing and covering 
problems. We describe two main lines of research: 

— algorithms by Plotkin, Shmoys and Tardos j2], 

— algorithms by Grigoriadis, Khachiyan, Porkolab and Villavicencio m 
and new developments. 

Furthermore, we discuss several applications of these techniques: scheduling 
on unrelated machines m. scheduling of parallel tasks 0, bin packing, strip 
packing preemptive resource constrained scheduling and fractional graph 
coloring [^. 
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Abstract. We consider a few problems connected to the commutation 
of languages, in particular finite ones. The goal is to emphasize the chal- 
lenging nature of such simply formulated problems. In doing so we give a 
survey of results achieved during the last few years, restate several open 
problems and illustrate some approaches to attack such problems by two 
simple constructions. 



1 Introduction 

The commutation equation 

XY = YX (1.1) 

is among the most fundamental equations in any algebraic structure. In the case 
of free monoids, that is to say of words, its solution is well known and folklore: 
two words commute if and only if they are powers of a common word. In the 
monoid of languages over a finite alphabet, even if the languages are only finite, 
the situation changes drastically: it is not likely that a complete solution can be 
found. This view is supported by the fact that the equation have solutions like, 
cf. 0, 

X = a + ab+ ba + bb and Y = X + X^ + bab + bbb 



and 

X = a + aa + aaa + ab + aba + b + ba and T = A \ {aa}. 

Moreover, from Theorem 3 it follows that the computational complexity of such 
solutions can be extremely high. 

However, if we fix X and ask Y to be the maximal (but not finite) solution 
of (1) the situation changes essentially. Indeed, it was asked more than 30 years 
ago by Conway whether such a solution is always rational for a given rational 
language X, cf. 0. Obviously, such a maximal language, referred to as the 
centralizer of X, is unique. Thus we formulate: 
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Conway’s Problem. Is the centralizer of a rational language rational, as 
well? 



Amazingly this problem is still unanswered, or in fact it is not even known 
whether the centralizer of a rational X is recursive. The restriction to a finite X 
does not seem to help to answer these questions. One of the goals of this paper 
is to summarize the special cases when the Conway’s Problem is known to have 
an affirmative answer. 

An interesting case when the solutions of the commutation equation is com- 
pletely known is that of polynomials (or formal power series) over a field and 
with noncommuting unknowns. In terms of languages this includes a case of lan- 
guages viewed as multisets. The characterization proved by Bergman, cf. Q or 
ca, is as follows: two polynomials p, q € K < A > commute if and only if there 
exist a polynomial r G K < A > and polynomials h, k G K < b > such that 

p = h{r) and q=k{r), 

where K < A > denotes the polynomials over K with A as the set of unknowns, 
and K < b > those with the single unknown b. This motivates us to state another 
problem: 



BTC-Problem. Given a family T of languages. Does the following hold for 
all A, Y gT-. 

XY = YX 



if and only if there exist a set V and sets of indices I and J such that 
X = UieiV^ and Y = UjejVY 



( 1 . 2 ) 



Intuitively, the above asks when the commutation of languages is charac- 
terized by a very similar condition than that for words. As we mentioned the 
polynomials have such a characterization - this is the source of the abbreviation: 
Bergman Typy Characterization. In many cases we require in the BTC-problem 
only that X is in T and Y is arbitrary. 

The examples given at the beginning of this section show that the BTC- 
property does not hold for all finite languages, it even does not hold when X 
contains only four words. The second goal of this paper is to summarize when 
the BTC-property is known to hold. 

Finally, as the third goal we discuss two simple approaches to attack the 
above problems. 

2 Preliminaries and a Basic Lemma 

Let A be a finite alphabet. We denote by E* (resp. A+) the free monoid (resp. 
free semigroup) generated by S. The identity of S* is denoted by 1, and referred 
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to as the empty word. Elements (resp. subsets) of E* are called words (resp. 
languages). We say that a word u is a prefix (resp. suffix) of a word w if we 
can write w = ut (resp. w = tu) for some word t. By u~^w (resp. wu~^) we 
denote the left (resp. right) quotient of w by u, and these notions are extended 
to languages in a natural way. For two words u and v the word u A v is their 
maximal common prefix. The set of all prefixes (resp. suffixes) of a language 
X C If* is denoted by Pref X (resp. Suf X). Preffc(X) denotes the set of all 
prefixes of length k of words in X . 

Our basic problem is when certain type of languages X, Y C E* satisfy the 
commutation equation 

XY = YX. 

The languages we consider are either finite, rational or of certain types of codes. 
For a finite set X we denote its cardinality by card X. We recall that a code 
C C If* is a set satisfying the implication: 

Xi . . .Xn = yi ■ ■ - ym with Xi, yj G C 

implies 

n = m and Xi = yi for i = 1, . . . , n. 

Particular cases of codes are those of prefix codes, i.e. sets where no word is a 
proper prefix of another, and uj-codes, i.e. sets C satisfying the above implication 
for n = m = oo. It is well known that the monoid of prefix codes under the 
operation of the product is free, cf. while the same does not hold for codes 
or tj-codes. 

For a language X C If* we defined its centralizer Z(X), or Z in brief, as 
the maximal set commuting with X. Clearly, Z is the union of all languages 
commuting with X, and moreover Z^ C Z, in other words, Z is a semigroup. 
Actually, to be precise we can define the centralizer as the maximal monoid or 
as the maximal semigroup commuting with X. Interestingly, these two notions 
are not related, at least in a trivial way. Indeed, for the X in the first example of 
Introduction the monoid and the semigroup centralizers are different modulo the 
empty word, namely {a, b}* and {a, &}^ \ {&}, respectively. In our subsequent 
considerations the centralizer means the semigroup centralizer. Consequently, it 
is natural to assume that X C If+. 

Let X C If+ be a language and Z its centralizer. Then clearly X”Z = ZX^ 
for all n > 0, and therefore we obtain simple approximations for the centralizer: 



Lemma 1. For any X C If+ its centralizer satisfies 

X+ CZ C Pref X* n Suf X*. (2.3) 

Moreover, «/card(X) < oo, then for each z € Z and w G X“ the word xw is in 
X“. 

The latter sentence follows from the first one by the Konig’s Lemma. 



18 



J. Karhumaki 



The approximations of Z in (3) deserve a few comments. First they are 
very easy to deduce. However, to decide whether or not C is an open 
problem, even in the case when card(X) < oo. On the other hand, to test 
whether the second inclusion is proper can be done by checking whether or not 
X(Pref X* n Suf X*) = (Pref X* fl Suf X*) • X - due to the maximality of Z. 
In the case of rational X this can be tested. 

In general, to decide whether two languages commute is computationally 
difficult. Indeed, we have 

Proposition 1 0. Given a context-free language L and a two-element set 
X, it is undecidable whether L and X commute, i.e. LX = XL. 

The proof is based on Theorem 3. 

3 A Survey on Conway’s Problem 

In this section we give a survey when Conway’s Problem is known to have an 
affirmative answer. Despite of being a natural question on computationally very 
simple languages it is not likely to be an easy problem. This is illustrated by the 
following surprising open problem. 

Problem 1. Is the centralizer of a finite set resursive? 

It is even more confusing to note that for concrete examples, even for 
concrete rational languages, the centralizer can normally be found quite easily, 
cf. Section 5. Still we only know that the centralizer is in co-Re - and hence 
Problem 1 is equivalent to a question whether the centralizer is recursively 
enumerable, cf. HD. 

As positive results we have: 

Theorem 1. Conway’s Problem is known to have an affirmative answer in the 
following cases: 

(i) X is a rational prefix code, cf. UM; 

(a) card(A) < 2, cf. 

(Hi) card(A) = 3, cf. 

(iv) X is a finite code which is elementary, synchronizing or contains a word of 
length 1, cf. m; 

(v) X is a rational uj -code, cf. m 

The proofs of these results use quite different techniques. Historically the 
oldest result is that of (i). Its proof is nothing but a combinatorial analysis of 
words, but not at all trivial. It is a challenge to find a simple proof for this nice 
result. Actually, even in the case of biprefix sets a simple proof is not obvious. 
Note also that, although the monoid of prefix sets is free and thus the tools used 
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in the word case are available, the diflicultiness remains since the problem asks 
the commutation of X with arbitrary sets. 

Problems (ii) and (iv) are solved by using Lemma 1 and so-called graph 
lemma on combinatorics on words. These proofs are not difficult, and in fact we 
give in Section 5 even a simpler solution to (ii). 

Problem (iii) is solved by a complicated analysis of certain language equations 
of very restricted types. Finally, to solve Problem (v) again a new method is 
used, namely unambiguous formal power series, and hence results of Bergman 
and Cohn are employed, cf. P and 0. 

It is also worth mentioning that in most cases when the answer to Conway’s 
Problem is known, the centralizer is very simple rational language: in cases (i), 

(ii), (iv) and (v) it coincides with (unless in case (ii) it is even simpler, 
namely C*" for some word t). 

As natural open problems we state: 

Problem 2. Does Conway’s Problem have an affirmative answer for all finite 
sets XI 

Problem 3. Does Conway’s Problem have an affirmative answer for all codes 
XI 



In Problem 3, even the case when card A = 4 is open. 

4 A Survey on the BTC-Problem 

This section is devoted to the BTC-problem, that is to say to search for families 
of languages for which the commutation of languages can be described by a word 
type condition (3) of Section 1. Note that in all these cases the assumption is 
that only X is from some family, and not that both X and Y are from the 
same family. Results are parallel to those - with one exception - presented in 
Theorem 1. 

Theorem 2. The BTC-property holds in the following cases: 

(i) X is a prefix set, cf. 1771/, • 

(ii) card A < 2, cf Jf; 

(iii) card A = 3 and X is a code, cf. m; 

(iv) X is a code which is elementary, synchronizing or contains a word of length 
1; cf 0/; 

(v) X is an uj-code, cf. m 

In cases (i), (ii), (iv) and (v) the proofs are based on the fact that the cen- 
tralizer of A is A+ and the observation of uni that for codes this fact implies 
the validity of the BTC-property. Consequently, only in case (iii) we have a 
slightly different result than the corresponding one in Theorem 1: here we need 
the assumption that A is also a code. The proof is not very easy. 
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As in the previous section we have some natural open questions: 

Problem 4. Does the BTC-property hold for all finite codes? 

Problem 5. Does the BTC-property hold in the case when card A = 3? 

It follows from the example in Introduction that the bound for card X in 
Problem 5 would be optimal. Actually, we conjecture that the answer to Problem 
5 is positive, however, to prove it seems to require complicated combinatorial 
considerations. 



5 Two Technical Approaches 

In this section we introduce two methods to attack our commutation problems. 

Let X C A+ be finite with card E > 2. We say that a word w G Pref A+ is 
branching if w pref A C Pref A"*" and that w is critical if it is branching and not 
in A+. We denote these sets by B and C, respectively. Obviously every word in 
A+ is branching. Moreover we have: 

Lemma 2. Let X C be finite. Then the centralizer Z of X is A’*' if and 
only if C = %. 

Proof. Since the centralizer is a semigroup, for any element z in Z and any x 
in A, the word zx is in the centralizer, and hence by Lemma 1, in Pref A"'". 
Consequently, any element of the centralizer is a branching point. So the result 
follows from the definition of critical points. □ 

Lemma 2 provides a very simple proof for the case (ii) in Theorems 1 and 2: 

Theorem 3. Let X C A+ with card A = 2. Then the centralizer of X is X~^ 
or t~^ , where t (if exists) is the minimal word such that X Q t* , and any set 
commuting with X is either of the form IJ j A* for some / C N, or of the form 
for some / C N, respectively . 

Proof. If there exists a word t such that X C t* the result is a simple combi- 
natorial application of Lemma 1. In the other case we know that A = {x, y} 
satisfies the condition xy yx, cf. |2]. Consequently, the set 

A' = {xy A yx)~^ X{xy A yx) 

is marked, i.e. the words start with different letters. Further, for any A C aE* 
and any Y 



XY = YX if and only if a ^Xaa ^Ya = a ^Yaa ^Xa, 



implying that 



Z{a~^Xa) = a~^Z{X)a. 
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It follows that 

Z(X') = {xy A yx)~'^ Z{X){xy A yx). 

But X' is marked so that, by Lemma 2, Z{X') = X'^ . Therefore also Z{X) = 
X+. 

Now the BTC property follows from an observation in cf. also . n 

We believe that this approach can be extended to give a simpler proof for the 
nice results of H3|> that is case (i) in Theorems 2 and 3, and for all three-element 
sets. However, this seems to lead, especially in the latter case, to complicated 
considerations. 

As the second approach we introduce a method to define the centralizer as 
the maximal fixed point of a mapping, cf. jO] and HD!. 

Let X C A+ be an arbitrary language. We define recursively 

Xo = Pref A+ n Suf X+ 

and 

Xi+i= X,\{X~'^{XXiAX^X)Ll{XX^AXiX)X-'^), for i > 0, 
where A denotes the symmetric difference of languages. Finally, we set 

^0 = n 

i>0 

Then it is not difficult to show that 

(i) Xi^i C Xi for all i > 0, 

(ii) Xi D Z for all i > 0, 

(iii) XZo = ZoX. 

Consequently, we obtain 

Theorem 4. The language Zq is the centralizer of X, that is Zq = Z(X). 

Note that, by construction, Zq is the maximal fixed point of the mapping 

^■.Y > — >Y\ {X~\YXAXY) + {YXAXY)X-^). 

Unfortunately, however, this approach might give the centralizer only as the 
limit, and therefore does not allow a solution to Conway’s Problem, although all 
Xi’s are rational whenever X is so. 

On the other hand, in practice this method seems to be a way to compute the 
centralizer of a rational language. Experiments show that it typically converges 
in a finite number of steps to the centralizer, which, moreover, is of one of the 
following forms: 

(i) 

(ii) S' = {w e E+\wX C XX+ and Xw C XX+}; 

(iii) Zq = Ai, but different from A+ and S. 

In all experiments the centralizer has turned out to be rather easily definable 
rational language. 
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6 Conclusions 

We have discussed two problems connected to the commutation of languages, 
and in particular to that of finite languages. Our goal was to point out that there 
are very challenging and simply formulated problems on this field. 

The following two general observations can be seen as partial explanations of 
the difficultiness of these problems. First the commutation requirement, that is 
the fact that a word z is in the centralizer only if two specified local operations 
X- and ■y~^ together lead to another element of Z, resembles a computational 
step in tag systems of Post, cf. ng. Second, the requirement is that, for all x in 
X, there must exist an y G X such the xzy~^ is again in Z. In other words, we 
have a problem of V3-type, and such problems are in many cases known to be 
difficult. 

Despite of the above we would guess that the answer to Conway’s Problem 
is affirmative - at least for finite X. Consequently, this problem would be a 
splended example of problems, which define something “simple”, but to prove 
this seems to be very difficult. Of course, there exist similar “jewels” in the 
theory of rational languages. For example, the nonconstructive rationality of 
an equality set of two prefix morphisms is such an example, cf. |E|. Another 
example is Higman’s theorem stating that for any language its upwards closure 
under the partial ordering of “being a scattered factor” is rational, cf. 0 or |2|. 
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Abstract. We present some of the recent results on computational 
complexity of approximating bounded degree combinatorial optimiza- 
tion problems. In particular, we present the best up to now known ex- 
plicit nonapproximability bounds on the very small degree optimization 
problems which are of particular importance on the intermediate stages 
of proving approximation hardness of some other generic optimization 
problems. 



1 Introduction 

An interesting approximation hardness phenomenon of combinatorial optimiza- 
tion was discovered in FyM| and |ALk^^92) . to the effect that the bounded 
degree instances of several optimization problems are hard to approximate to 
within an arbitrary constant. This fact seemed to be a bit puzzling at the time 
as bounded degree instances of many optimization problems were known to 
have trivial approximation algorithms dramatically improving performances of 
the best known approximation algorithms on general instances. An interesting 
artifact on their complementary, i. e. dense, instances was also the existence of 
polynomial time approximation schemes (PTASs) for them |A K K9,'i| . [KZ97j . 
see EH for a survey. We discuss here explicit approximation lower bounds for 
bounded degree instances with a very small bounds on degrees like 3 or 4, and 
also the best known approximation algorithms on that instances. These instances 
have turned out to be particularly important at the intermediate reduction stages 
for proving hardness of approximation of some other important optimization 
problems, like Set Cover, some restricted versions of Traveling Salesman Prob- 
lem, and the problem of Sorting by Reversals motivated recently by molecular 
biology, cf. Ena, iFYTin . iFran . mm . if™ , eh, e™ . We mention 
here some interesting new results on asymptotic relations between hardness of 
approximation and bounds on a degree of instances IRUUI . 141)11 . These results do 
not yield though explicit lower approximation bounds for small degree instances 
needed in applications mentioned before. 
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We survey in this paper the best known up to now explicit approximation 
lower bounds for the small degree (number of variable occurrences) optimiza- 
tion problems, like the problems of maximization or minimization of the sat- 
isfiability of systems of linear equations mod 2, MAX-CUT, MAX- and MIN- 
BISECTION, MAX-2SAT, MAXIMUM INDEPENDENT SET, and MINIMUM 
NODE COVER |BK99| . |BK()Ibj . We move on, and apply these results to get 
explicit lower approximation bounds for the problem of Sorting by Reversals 
lEKHDI, and the Traveling Salesman Problem with distances one and two lEKnn|. 
Finally, we mention recent improvement on approximation ratios of algorithms 
for small degree MAX-CUT and MAX-BISECTION problems based on local en- 
hancing methods for semidefinite programming IKKLOOaj . [EKLOObj . jKKmT] . 

2 Bounded Degree Maximization Problems 

We are going to define basic optimization problems of this section. 

— MAX-Efc-LIN2: Given a set of equations mod 2 with exactly k variables 
per equation, construct an assignment maximizing the number of equations 
satisfied. 

— 6-OCC-MAX-EA:-LIN2: Given a set of equations mod 2 with exactly k vari- 
ables per equation and the number of occurrences of each variable bounded 
by b, construct an assignment maximizing the number of equations satisfied. 

— 6-OCC-MAX-HYBRID-LIN2: Given a set of equations mod 2 with exactly 
two or three variables per equation, and the number of occurrences of each 
variable bounded by b, construct an assignment maximizing the number of 
equations satisfied. 

— AOCC-MAX-2SAT: Given a conjunctive normal form formula with two vari- 
ables per clause, construct an assignment maximizing the number of clauses 
satisfied. 

— d-MAX-CUT : Given an undirected graph of degree bounded by d, partition 
its vertices into two groups so as to maximize the number of edges with 
exactly one endpoint in each group. 

— d-MIS: Given an undirected graph of degree bounded by d, construct a maxi- 
mum cardinality subset of vertices such that no two vertices of it are adjacent. 

We are going to display now approximation preserving reductions which re- 
duce from the MAX-E2-LIN2 and the MAX-E3-LIN2 problems. The method 
of reductions depends on a new wheel- amplifier construction of Berman and 
Karpinski designed specially for bounded degree problems. This kind 

of amplifier has turned out to be more efficient than the standard expander 
amplifiers (cf. e.g., Arora and Lund IAL97I 1 for small degree, and number of 
occurrences, optimization problems. 

We start with the following known inapproximability results of Hastad fHTFTj . 

Theorem 1. (B) For any 0 < e < it is NP-hard to decide whether 
an instance of MAX-E2-LIN2 with 16n equations has its optimum value above 
(12 — e)n or below (11 -|- e). 
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Theorem 2. (m ) For any 0 < e < it is NP-hard to decide whether 
an instance of MAX-E3-LIN2 with 2n equations has its optimum value above 
(2 — e)n or below (1 + e)n. 

In Berman and Karpinski |BKhh| the following polynomial time randomized 
approximation preserving reductions were constructed: 

- /i : MAX-E2-LIN2 ^ 3-OCC-MAX-E2-LIN2, 

- /2 : MAX-E2-LIN2 ^ 3-MAX-CUT, 

- /a : MAX-E3-LIN2 ^ 3-OCC-MAX-HYBRID-LIN2, 

The constructions for fi, and /2 use variants of wheel- amplifier methods, 
whereas a construction for /a uses certain 3-hypergraph extension of it. The 
following optimizing properties of fi, f 2 , and /a were proven in [BK99j . 

Theorem 3. ft B K9{jf ) For any 0 < e < it is NP-hard to decide whether an 
instance of /i(MAX-E2-LIN2) G 3-OCC-MAX-E2-LIN2 with 336 edges has its 
optimum value above (332 — e)n or below (331 -I- e)n. 

A similar result can be proven for / 2 , and the 3-MAX-CUT-problem. 

Theorem 4. f\ B K9 !^ ) For any 0 < e < it is NP-hard to decide whether an 
instance of /2(MAX-E2-LIN2) G 3-MAX-CUT with 336 edges has its optimum 
value above (332 — e)n or below (331 -I- e)n. 

For /a and MAX-HYBRID-LIN2 we have 

Theorem 5. For any 0 < e < it is NP-hard to decide whethr an 

instance of /a(MAX-E3-LIN2) G 3-OCC-MAX-HYBRID-LIN2 with 60n equa- 
tions with exactly two variables and 2n equations with exactly three variables 
has its optimum value above (62 — e)n or below (61 -I- e)n. 

Theorem 4 can be also used to derive the following bound for 3-OCC-MAX- 
2SAT. 

Theorem 6. ( WKfif^l ) For any 0 < e < it is NP-hard to decide whether 
an instance of 3-OCC-MAX-2SAT, with 2016n clauses has its optimum above 
(2012 — e)n or below (2011 -I- e)n. 

The 3-OCC-MAX-HYBRID-LIN2 problem and Theorem 5 can be used to 
derive lower bounds for 4-MIS problem, and using some more subtle construction, 
even for 3-MIS problem. 

Theorem 7. (\BK99f ) For any 0 < e < it is NP-hard to decide whether 
an instance of 4-MIS with 152n nodes has its optimum value above (74 — e)n 
or below (73 -I- e)n, and whether an instance of 3-MIS with 284n nodes has its 
optimum value above (140 — e)n or below (139 -I- e)n. 



The results above imply the following explicit nonapproximability results. 
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Corollary 1. For every e > 0, it is NP-hard to approximate: 

(1) 3-OCC-MAX-E2-LIN2 and 3-MAX-CUT to within a factor 332/331 - e, 

(2) 3-OCC-MAX-HYBRID-LIN2 to within a factor 62/61 - e, 

(3) 3-OCC-MAX-2SAT to within a factor 2012/2011 - e, 

(4) 4 .-MIS to within a factor 74:/7S — e, and 3-MIS to within a factor 140/139 — e. 



The best to our current knowledge gaps between upper and lower approx- 
imation bounds are summarized in Table 1. The upper approximation bounds 
are from !CW94| . fRF^ . (EEaH!, |FC^ . IFKLOOaj .The technical results of this 
section will be used also later on in our paper. 



Table 1. Bounded Degree Maximization Problems 



Problem 


Approx. Upper 


Approx. Lower 


3-OCC-MAX-E2-LIN2 


1.0858 


1.0030 


3-OCC-MAX-HYBRID-LIN2 


2 


1.0163 


3-MAX-CUT 


1.0858 


1.0030 


3-OCC-MAX-2SAT 


1.0741 


1.0005 


3-MIS 


1.2 


1.0071 


4-MIS 


1.4 


1.0136 



3 Bounded Degree Minimization Problems 

We are going to introduce now the following minimization problems. 

— d-Node Cover: Given an undirected graph of degree bounded by d, construct 
a minimum cardinality subset of vertices such that each edge of a graph has 
hat least one of its endpoints in it. 

— MIN-EA:-LIN2: Given a set of equations mod 2 with exactly k variables 
per equation, construct an assignment minimizing the number of equations 
satisfied. 

— &-OCC-MIN-Efc-LIN2: Given a set of equations mod 2 with exactly k vari- 
ables per equation and the number of occurrences of each variable exactly 
equal to b, construct an assignment minimizing the number of equations 
satisfied. 

— MIN-BISECTION: Given an undirected graph, partition the vertices into 
two equal halves so as to minimize the number of edges with exactly one 
endpoint in each half. 
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— d-MIN-BISECTION: Given a d-regular graph, partition the vertices into two 
equal halves so as to minimize the number of edges with exactly one endpoint 
in each half. 

We will specialize now techniques of Section 2 to obtain lower approximation 
bounds on bounded degree minimization problems. 

We start with a direct application of Theorem 7 towards d-Node Cover prob- 
lem. For a given undirected graph G = (V,E), and a maximum independent 
set I of G, V\I is a minimum node cover of G. We take now an instance of 
4-MIS with 152n nodes. It is NP-hard, for any 0 < e < to decide whether 
4-Node Cover has its optimum value above (152 — 73 — e)n = (79 — e)n or below 
(152 - 74-1- e)n = (78 -I- e)n. Similarly for 3-Node Cover. Thus we have 

Theorem 8. For any 0 < e < it is NP-hard to decide whether an instance of 
4-Node Cover with 152n nodes has its optimum value above (79 — e)n or below 
(78 -I- e)n, and whether an instance of 3-Node Cover with 284n has its optimum 
value above (145 — e)n or below (144 -|- e)n. 

Theorem 8 gives the following approximation lower bounds for 4-Node Cover 
and 3-Node Cover problems. 

Corollary 2. For every e > 0, it is NP-hard to approximate 

1. 3-Node Cover to within a factor 145/144 — e, 

2. 4~Node Cover to within a factor 79/78 — e. 

We turn now to the bounded occurrence minimum satisfiability of linear 
equations. 

We need the following recent result of Dinur, Kindler, Raz and Safra 
(see also 113123). 

Theorem 9. ffDKRSOO^ ) MIN-LIN2 is NP-hard to approximate to within a 
factor for some constant c. 

MIN-LIN2 is equivalent to the well known Nearest Codeword problem (cf. 
IIAljSS93] l. Only very recently the first sublinear approximation ratio 0{n/logn) 
algorithm was designed by Berman and Karpinski IRKOibl . 

We introduce now a notion of an (r, t)-approximation algorithm. For two 
functions r and t, we call an approximation algorithm A for an optimization 
problem P, an (r(n), t(n))-approximation algorithm if A approximates P within 
an approximation ratio r(n) and A works in 0{t{n)) time for n a size of an 
instance. 

Berman and Karpinski fBKOlbj proved the following result on the (r, t)- 
approximations of the 3-OCC-MIN-E3-LIN2 problem. 

Theorem 10. flBKOlbf ) There exists a constant c such that if there exists an 
(r(n), t(n))-approximation algorithm for 3-OCC-MIN-E3-LIN2, then there exists 
an (r(cn), t(cn))-approximation algorithm for MIN-LIN2. 
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Theorem 9 entails now 

Theorem 11. The problem 3-OCC-E3-LIN2 is NP-hard to approximate to 
within a factor n'^/^ogiogn some constant c. 

The 3-OCC-MIN-E3-LIN2 problem is equivalent to the exactly-3 bounded 
occurrence 3-ary Nearest Codeword problem (c.f. KSIHZI), and therefore we 
have 

Corollary 3. The 3-ary Nearest Codeword problem with the number of occur- 
rences of each variable exactly equal to 3 is NP-hard to approximate to within 
a factor j^c/Zogiogn gonie constant c. 

We apply a similar technique for the problem of MIN-BISECTION. Here our 
result will be only relative to the approximation hardness of MIN-BISECTION, 
the status of which is wide open, and we know currently of no proof technique 
which excludes existence of a PTAS for that problem. 

Somewhat surprisingly in that context, Berman and Karpinski IRKOibl 
proved the following result on approximation hardness of bounded degree version 
of MIN-BISECTION. 

Theorem 12. ( jBKOlhf ) If there exists an (r(n), t(n))-approximation algorithm 
for 3-MIN-BISECTION, then there exists an (r(n^), t(n^))- approximation al- 
gorithm for MIN-BISECTION. 

The best currently known approximation algorithm for the MIN- 
BISECTION is 0{log^n) due to Feige and Krauthgamer [KKOOj . Any asymptotic 
improvement on approximation ratio r for 3-regular graphs, say r = o{log^n), 
will entail, by Theorem 11, an improvement on an approximation ratio for the 
general MIN-BISECTION. 

A similar technique can be also used to prove approximation hardness re- 
sult for the general planar MIN-BISECTION of the planar MIN-BISECTION 
problem on 3-regular graphs. 

4 Some Application 

We are going to apply our previous results for some other generic optimization 
problems. The first problem is one of the most important problems in analy- 
sis of genome rearrangements, and it is being recently also motivated by other 
algorithmic problems of computational molecular biology. 

— MIN-SBR (Sorting by Reversals): Given a permutation, construct a mini- 
mum length sequence of reversals (see for definitions EEHEI) which trans- 
forms it to the identity permutation. 

We refer also to some other variants of Sorting by Reversals problems studied 
in j( '99j . called MSBR and Tree SBR (see the definitions there). 
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The proof technique used in [HKOOj to prove explicit approximation lower 
bound of Theorem 7 for 4-MIS can be adapted to prove for the first time the in- 
approximability of MIN-SBR, and, in fact, also giving an explicit approximation 
lower bound for that problem. 

Theorem 13. fWK9!^ ) For every e > 0, it is NP-hard to approximate MIN- 
SBR within a factor 1237/1236 — e. 

Caprara pH?) has used the above result to prove inapproximability of the 
both beforementioned problems, MSBR, and Tree SBR, and to compute the first 
explicit approximation lower bounds for those problems. 

We turn now to another application of the results of Section 2. We denote 
by (1,2)-TSP the Traveling Salesman Problem with distances one and two, and 
its asymmetric version by (1,2)-ATSP (cf. [F YD3] . [V!I2] ). 

Engebretsen and Karpinski fEKOOj has used recently the result on 3-OCC- 
MAX-HYBRID-LIN2 of Theorem 5 to prove the following explicit inapproxima- 
bility result for (1,2)-ATSP problem. 

Theorem 14. ffEKOO^ ) For every e > 0, it is NP-hard to approximate (1,2)- 
ATSP within a factor 321/320 — e. 

The construction used by Engebretsen and Karpinski [EKOOj could be also 
adapted to yield an explicit result for (1,2)-TSP. 

Theorem 15. ( \EK(}[^ ) For every e > 0, it is NP-hard to approximate (1,2)- 
TSP within a factor 743/742 — e. 

5 New Upper Approximation Bounds 

The intricacy of proving the first explicit approximation lower bounds for small 
degree optimization problems, and the resulting huge gaps between upper and 
lower approximation bounds has stimulated research on improving approxima- 
tion ratios for those problems as well as for some other generic problems. 

The first gap for 3-MAX-CUT (and 3-OCC-MAX-E2-LIN2) was improved 
recently by Feige, Karpinski and Langberg F^bnnaj . see Table 1. The technique 
of [FKlRflaj is based on a new local enhancing method for semidefinite programs. 

Theorem 16. f jFKLOOd^ ) There exists a polynomial time algorithm approxi- 
mating 3-MAX-CUT within a factor 1.0858. 

We note that the best approximation ratio currently known for MAX-CUT 
problem on general graphs is 1.1383 (' |CW94j l. and the best known approxima- 
tion lower bound is 1.0624 We note also that for the semidefinite relaxation 
of MAX-CUT used in |CW94j . the integrality gap is at least 1.1312, even for 
2-regular graphs. Thus the bound of Theorem 16 beats the integrality bound 
even for 2-regular graphs. 

We turn now to the special case of regular bounded degree graphs, and will 
investigate approximation algorithms for the MAX-CUT and MAX-BISECTION 
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(partitioning of a graph into two halves so as to maximize a number of the edges 
between them). 

Rd-MAX-CUT and Rd-MAX-BISECTION are the MAX-CUT and MAX- 
BISECTION problems, respectively, restricted to d-regular graphs. 

Feige, Karpinski and Langberg |FKIjflflaj . [EKLOObj were able to improve 
the best known approximation ratios for both bounded degree problems, Rd- 
MAX-CUT, and Rd-MAX-BISECTION. The best known approximation ratio 
for MAX-BISECTION on general graphs is 1.4266 |HZ00j . 

Theorem 17. (]FKL00<^ . IFKLQO^ ) There are polynomial time algorithms 
that approximate R3-MAX-CUT and R3-MAX-BISECTION problems within 
factor 1.0823 and 1.1991, respectively. 

Using an additional local adhancement method, Karpinski, Kowaluk and 
Lingas IIKKLOO^ . have further improved approximation ratios of the low degree 
Rd-MAX-BISECTION problems. 

Theorem 18. (\KKL0(}'j ) There exists a polynomial time algorithm approxi- 
mating R3-MAX-BISECTION within a factor 1.1806. 

Interestingly, the first improvements on approximation ratios of MAX- 
BISECTION on low degree planar graphs undertaken in [KKbOO] has lead to 
design of the first PTASs for the general planar MAX-BISECTION as well as 
for other geometrically defined classes of graphs (see [UKbSOlj ) . 

On the lower bounds side, we note that the techniques of yield also 

the best up to now explicit approximation lower bounds for R3-MAX-CUT, and 
R3-MAX-BISECTION problems equal to the lower approximation bound for 
3-MAX-CUT problem of Section 2. 

6 Summary of Approximation Results on Bonnded 
Degree Minimization Problems 

We present here (Table 2) the results of Section 3 and 4 on bounded degree mini- 
mization problems and the best to our knowledge gaps between upper and lower 
approximation bounds on those problems. The upper approximation bounds are 
from lEEni, lEEng, IBKOlb) . [FKQQ] . [BHKQl! . 

7 Open Problems and Fnrther Research 

An obvious open problem is to improve on both the lower and upper approx- 
imation bounds of bounded degree optimization problems, especially on those 
with the very small degree bounds. The essential improvements on the explicit 
lower bounds for these problems might be of paramount difficulty though, but 
same time they are also of great interest. Any such improvement would have im- 
mediate effects on the explicit lower bounds for other optimization problems, as 
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Table 2. Bounded Degree and Weight Minimization Problems 



Problem 


Approx. Upper 


Approx. Lower 


3-Node Cover 


1.1666 


1.0069 


4-Node Cover 


1.2857 


1.0128 


3-OCC-MIN-E3-LIN2 


0(n/logn) 


^n(l)/loglogn 


3-MIN-BISECTION 


O(log^n) 


Equal to MIN-BISECTION 


MIN-SBR 


1.375 


1.0008 


(1,2)-TSP 


1.1667 


1.0013 


(1,2)-ATSP 


1.4167 


1.0031 



indicated in this paper. Perhaps somewhat easier undertaking would be improv- 
ing on upper approximation bounds. Here essential improvements were already 
achieved on the problems like a small degree MAX-CUT, and MAX-BISECTION 
mentioned in Section 5. How about improvements on other bounded degree op- 
timization problems? 
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Abstract. This paper considers interrelations between universal alge- 
bra, algebraic logic, geometry and computer science. The key idea of the 
paper is to show that problems, coming from computer science, require 
introducing of highly non-trivial mathematical structures. On the other 
hand, algebraic models in computer science give deeper understanding 
of problems essence. 

This general idea is illustrated on the example of knowledge bases. Theo- 
rems concerning the knowledge base equivalence problem are formulated. 



1 Introduction 

Universal algebra had been started from the clear desire to illuminate the com- 
mon core of various classical algebraic structures, that is to distinguish the com- 
mon features of groups, semigroups, rings, and others. The development of uni- 
versal algebra gave rise also to its own new problems, which have been inspired 
by the ties with logic and category theory. 

At the meantime the progress of universal algebra is determined, to great 
extent, by its computer science ties. In particular, computer science applications 
motivate the use of operations with arbitrary arity and a special attention is paid 
on multysorted algebras. Computer science also extends the application area of 
algebraic logic. 

Algebraic models in computer science are intended to give better explanation 
of the essence of the problems. This general rule works in the database theory, 
in knowledge theory and also in the theory of computations. 

On the other hand, computer science stimulates the appearance of new ideas, 
problems and constructions in universal algebra. This exchange essentially en- 
riches algebra and category theory (see for example P, |2|, P, P, p|, |0|, |E|, 
0 )- 

It should be underlined specially that the interrelations between algebra and 
computer science assume heavy use of advanced algebraic structures, which are 
far from simple algebraic notions. 

These general statements we will illustrate on the example of knowledge 
bases. We will formulate two theorems concerning the knowledge bases equiva- 
lence problem. These theorems produce an algorithm for processing of such an 
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equivalence. They can be used in practice ignoring all algebraic difficulties of 
their proofs 0, |H|- 

In the next section we give all necessary preliminary notions. 



2 Knowledge 

We consider elementary knowledge, i.e., knowledge represented in the First Order 
Logic (FOL). 

We assume that there are three components of knowledge: 

1) The description of the knowledge. 

It is the syntactical part of the knowledge, written out in the language of the 
given logic. The description reflects, what we want to know. In knowledge bases 
(KB) it is a query. 

2) The subject of the knowledge which is an object in the given held of 
application, that is, an object for which we determine knowledge. 

3) Content of the knowledge (its semantics). This is a reply in KBs. 

The first two components are relatively independent, while the third one is 
uniquely determined by the previous two. 

The following example explains this approach. Let i? be a held of real num- 
bers. Consider it as a subject of knowledge. Knowledge is described by a system 
of equations T of n variables with real coefficients. Content of knowledge is the 
locus in the space i?" given by the system of equations T. 

We will consider also a situation when the system T contains not only equal- 
ities, but arbitrary formulas of FOL. In other words, we describe knowledge in 
some logical language and interpret its content as a geometrical image. This is 
the starting point for further considerations. 



3 Algebra 

In the subsequent three sections we give background on algebra, logic and ge- 
ometry. 

Fix a variety of algebras 0, reflecting knowledge type. It is given by a system 
of operations f2 and a system of identities controlling operations action. For 
example, we have a variety of all boolean algebras or a variety of automata. 

An important fact is that every variety contains free algebras. Elements of 
a free algebra W = W{X) for a set X are terms (or 0-terms, or 0-words). 
They are built from the elements of the set X with the help of operations of the 
system fi and the rules of the variety 0. The main property of these W can be 
expressed by formula 
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id - embedding of X in W = W(X) 

V - calculates the value of variables in an arbitrary algebra G & 0 
H - calculates the value of terms in G 

/i is a homomorphism of algebras, extending the mapping /i. It is fully deter- 
mined by id and ly. 

Along with algebras we consider also models. A system of symbols of relations 
<P is fixed. The symbols are realized in the algebras G from 0. In the model 
(G, /) the function / realizes every symbol ip G in the given algebra G. If a 

symbol (/? is a symbol of n-ary relation, then f{ip) is a subset in the space G". 
Further we will regard such functions / as instances of a KB. 

In the sequel the models (G, <P, /) represent subject of knowledge. 

In order to compare KBs we need a notion of multimodel (G, d>, F), where F 
is a set of admissible realizations /. We write KB = KB{G, d>, F). 

A reply to a query to a KB is a subset in some G”, G G 0. Cartesian 
powers G" we consider as affine spaces of points. Elements of G" are points 
a = (oi, . . . , On), tti € G. Thus a reply to a query accepts a geometrical meaning 
and therefore the geometrical approach starts. 

Another presentation of an affine space G”, more suitable in our case, is the 
set Hom{W{X),G) of all homomorphisms p : W{X) -G G for the set X = 
, . . . , 

A bijection 

ax ■■ Hom{W{X),G) -G G” 

is determined by the rule ax{ij) = This rule allows to con- 

sider the set Hom(W(X), G) as an affine space. Now the points are homomor- 
phisms /I : W(X) -G G. 

4 Logic 

We have to deal not with a pure logic but with logic in a given variety 0. Fix 
a variety 0, a set of symbols of relations <P, and a finite set of variables X. 
Consider a set of symbols of logical operations 

Lx = L = {V, A, “ 1 , 3x, X G Xy 

We describe knowledge with the help of a set (algebra) of formulas, denoted by 
L(PW{X), X is finite. The set L<1>W{X) consists of atomic formulas of the form 

w = w', a,nd (p{wi, . . . ,Wn), P>GL>, w,w' , wi, . . . ,Wn G W{X). 

and arbitrary formulas which are built from atomic ones using operations from 
the signature L. 

Along with algebra of formulas, we consider another algebra in the same 
signature L, denoted by Bool{W{X),G). It consists of all subsets of the affine 
space H om{W (X) , G) . Boolean operations V, A, -i are defined in the usual way. 
Let us define quantifiers. For A C Flom{W{X), G) we set 



pL G BxA 3n G A : p{y) = v{y) 'iy ^ x. 
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This definition exactly refiects the geometrical meaning of the existential quan- 
tifier. 

In order to obtain content of knowledge by its description, let us define the no- 
tion of the value of a formula. Denote by V al {u) , u G L^W the value of a formula 
for a model (G, /). Here Val(u) is a subset of an affine space IIom(W(X), G). 

Define the value of atomic formulas: 

1 ) : W —>■ G G Val(w = w') = w'^ in G. 

2 ) fj,GVal{ip{wi,...,Wn)) ^ G f{(p). 

Values of arbitrary formulas are defined by induction, using the rules: 

V al{u\ V U2) = V al{u\) U Val{u2), 

V al{ui A U2) = Val(ui) fl Val{u2), 

V al{3xu) = 3xVal{u). 

Here, for example, ui V U2 is a formal description of knowledge, while Val{u\) U 
V al{u2) is its content. 

It follows from the definitions above, that the value of the formula 
ip{xi, . . . ,Xn) on the model (G, <?, /) determines the instance / of the model, 
namely 

axValf{ip{{xi, Xn)) = /{if)- 

Consider a correspondence between the description of the knowledge and 
its content. Let T be a set of formulas of L<PW, determining description of 
knowledge. Let us define content of knowledge for the subject of knowledge 
(model) (G, <P, /) by the rule 

T^' = Pi Val{u). 

ueT 

From the geometrical point of view the knowledge content is an algebraic 
variety of points in the affine space Hom{W, G), or locus. In the next section we 
will consider geometrical nature of such content of knowledge. 

5 Geometry 

According to the approach above, one can look at knowledge content as at ge- 
ometric object, namely, an algebraic variety in the corresponding affine space. 
Algebraic variety turns to be a geometrical object if we consider it in the frames 
of some special category. Denote by K^e{f) a category of algebraic varieties 
for the given model {G,<P,f). Objects of this category are pairs (X,A), where 
A is an algebraic variety in the space Hom{W,G). Morphisms {X,A) -G (Y,B) 
are given by homomorphisms of algebras s : W(Y) -G W(X). To every s it 
corresponds a map of affine spaces 



s : Hom{W{X),G) -G Hom{W{Y),G) 
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defined by the rule: for the point v : W{X) — G we set s{v) = vs : W{Y) — H. 
In order to define morphisms we take those s for which s{v) £ B for every point 
V G A. These s are considered as morphisms in the category K^Q{f). Such 
morphisms give connections between algebraic varieties. In particular, for every 
object {X, A) morphisms {X, A) — >■ {X, A) are tied with the movement inside 
given A. 

This is the first occurrence of geometry in A. Another dynamics in A relates 
to the group of automorphisms of the model Aut{f), also acting in A and giving 
the second occurrence of geometry. The third one is appeared as the system of 
all subvarieties in the given algebraic variety. 



6 Category of Knowledge 

Category of knowledge is defined for the model (G, /) and is denoted by 

Know 4 , 0 {f). Its objects (knowledge) are denoted by {X,T,A). In addition to 
the content of knowledge A we fix its description T. Morphisms 

s:{X,T,,A)^{Y,T2,B) 

are given like in the category K^o^f). It is proven that they connect not only 
content of knowledge but also its description. 

7 Algebraic Logic 

Construction of algebraic model of knowledge base requires replacement of the 
pure logic by algebraic logic. 

Every logical calculus assumes that there are some definite sets of formulas 
of the calculus, axioms of logic and rules of inference. This is the base to de- 
fine the syntactic equivalence of formulas, well correlated with their semantical 
equivalence. The transition from logic to algebraic logic is grounded on treating 
logical formulas up to such equivalence which gives compressed formulas. 

This transition leads to various special algebraic structures. Boolean algebras, 
associated with the propositional turns to be the free boolean algebra. Similarly, 
Heyting algebras, associated with intuitionistic propositional calculus turn to 
be the free Heyting algebras. However, for FOL the corresponding algebraic 
structures are not free. 

The construction of logical calculus usually assumes some fixed infinite set of 
variables. Let us denote it by A°. It looks better to substitute this universal A° 
by the system of all its finite subsets X. This leads us to multisorted logic and 
multisorted algebraic logic. Every formula here has its sort X. In particular, we 
come to multisorted Halmos algebras, defined further. This gives some new view 
of the algebraization of FOL. Besides, let us note that the logic here is related 
to a fixed variety of algebras 0 (compare 0). We need all this for universal 
algebraic geometry and applications in knowledge science. 
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We fix here some variety of algebras 0. This assumes a signature 17 = 17(0) 
and a system of identities ld{0). 

Further we define Halmos categories for the given 0. 

First we define existential quantifiers for a given Boolean algebra B. These 
are the mappings 3 \ B ^ B with conditions 

1. 30 = 0 

2. a < 3a, a(3a) = a 

3. 3(a A 3&) = 3a A 36, 0,a, 6 gB 
Quantifiers \/ : B ^ B are defined dually: 

1. VI = 1 

2. a > Va 

3. V(a A VS) = Va A V6 

Let now S be a Boolean algebra and X a set. We say that S is a quantorian 
7f-algebra if for every x G X we have a quantifier 3x : B ^ B and for every two 
x,y G X it holds the equality 3x3y = 3y3x. 

One may consider also quantifier X-algebras B with equalities over W{X). 
In such algebras to each pair of elements w,w' G W(X) of it corresponds an 
element w = w' G B satisfying the conditions 

1) w = w is the unit in B 

2) (wi = A ... A Wn = w'fi) < (wi . . . WnOJ = w'^ . . . w'„io) where to is an 
operation in 17. 

Now for the given variety 0 we consider the category 0° of all free in 0 
algebras W = W{X), where X is an arbitrary finite set. Morphisms here are 
homomorphisms in 0. The category 0° plays an important role in equational 
geometry. 

The same role in the geometry in FOL plays the category Hal<fe of formulas 
in FOL. This category Hal<f© are connected with the category 0°. An arbitrary 
Halmos category H for every finite X fixes some quantorian A-algebra with 
equalities H{X). These are objects in H. 

The morphisms in H are connected with morphisms in 0®. For every s : 
W{X) — >■ W{Y)ixi 0° we have a morphism 

s* = s : H{X) -G H{Y). 

These are all morphisms in H . 

We assume that 

1) The transitions W{X) -G H{X) and s — >■ s* constitute a (covariant) 
functor 0° — >■ H. 

2) Every s* : H{X) H{Y) is a Boolean homomorphism. 

3) The coordination with the quantifiers is the following: 

3.1) Si3xa = S23a;a, a G H{X) if s\y = S 2 y for every y G X,y x. 

3.2) s3xa = 3(sx)(sa) it sx = y G Y and y = sx does not lie in the 
support of see', x' G A, x' yf x. 

4) The following conditions describe coordination with equalities 

4.1) s*(w = w') = {sw = sw') for s: W(X) — >• W{Y), w, w' G IV(X). 
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4.2) s^aA {w = w') < s^,a for an arbitrary a £ H{X), x £ X, w,w' £ 
W(X), and s^:W{X) — i W{X) is defined by the rule: s^(x) = w,sy = y,y £ 
X, y^x. 

So, the definition of Halmos category is given. 

Suppose now that all finite X are subsets of some infinite universum X^. 
In this case the Halmos category can be considered as a multisorted Halmos 
algebra. 

Multisorted Halmos algebra is an algebra of the form 

H={H{X),X C X°) 

where every H{X) is an object of the category H with its algebraic structure 
and morphisms of the category are considered to be algebraic operations on the 
algebra H. 

Multisorted Halmos algebras determine a variety which is denoted by Hale- 
The principal examples of Halmos algebras and categories have the form 
Hale{<P) and Hale{G), where is a set of symbols of relations and G is an 
algebra in the variety 0. The algebra Hale{(I>) is the algebra of compressed 
formulas. The algebra Hale{G) is defined by 

Hale(G)(A:) = Bool{W{X),G) 

and for every s : W(X) — >■ W{Y) in 6>° we have 

s = s* : Bool{W{X),G) -£ Bool{W{Y), G), 



in Hale{G). 

For every model {G.<P, /) there is a homomorphism of Halmos algebras 
Val/ : Hale(^) ^ Hale(G). 

This homomorphism computes values of the formulas. 



8 Knowledge Bases 

We do not give here a formal definition of a knowledge base as a mathematical 
structure. Instead, we provide a kind of descriptive definition that uses the main 
ingredients of the formal one. 

Knowledge base is defined for a fixed multimodel (G,<P,F), G £ 0. The 
variety of algebras 0 is fixed. As a logical ground of knowledge bases we take 
the Halmos algebra Hale{^)- 

Take T C Hale{^){X) and A C Hale{G){X). For f £ F we set 

Tf =A= f] Valf{u) 

u€T 

A^ = T = {u\A c Valf{u)} 
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The set T is considered as knowledge description. This is a query to knowl- 
edge base. 

The set A = is a content of knowledge, i.e., the reply to a query. By 
definition, a reply to a query is an algebraic set and it is an object of the category 

The set T = is a filter of the boolean algebra Hale{^){X). The factor 
algebra by this filter is an invariant of the algebraic set A. 

9 Isomorphism and Equivalence of Knowledge Bases 

Let two multimodels (Gi,^i,Fi) and (G2,^2,T2) be given. Denote the corre- 
sponding knowledge bases by KB\ and KB2- In order to compare them, let us 
define the notion of isomorphism of knowledge bases. 

Definition 1 

Knowledge bases KB\ and KB2 are isomorphic, z/<?i = <?2 = ^ and 

1 . There is a bijection a : Fi ^ F2, 

2 . For all f € Fi there is an isomorphism of categories 

7 / : K,pe{f°‘) 

i.e., 7/ is a bijection on the objects, preserving all links, 

3 . For all T it holds 

= fGF,. 

The notion of equivalence is much weaker. First, T>i = < 1>2 is not required, 
and, second, knowledge bases meet only two conditions. 

Definition 2 

Knowledge bases KB\ and KB2 are equivalent, if 

1 . There is a bijection a \ F\ ^ F2, 

2 . For all f € Fi there is an isomorphism of categories 

7 / : K<p^e{f) -)> iL^> 2 ©(/“). 

It is proven that such a definition of equivalence exactly refiects the idea of 
informational equivalence of knowledge bases. 

10 Main Results 

Definition 3 

Multimodels (Gi,T>,Fi) and (G2,^, F2) are isomorphic if there is a bijection 
a : Fi ^ F2 such that the models (Gi,<P,f) and (G2,^, /“) are isomorphic. 

Theorem 1 

Knowledge bases KB\ and KB2 with the finite G\, G2 are isomorphic if and 
only if the corresponding multimodels are isomorphic. 

Definition 4 

Models {Gi,<Pi, fi) and {G2,’l’2i f2) are automorphic equivalent if 
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1. Algebras Gi and G 2 are isomorphic, 

2. Groups of automorphisms Aut{f\) and Aut{f 2 ) are conjugated by some iso- 
morphism (5 : G 2 — >■ Gi, i.e., Aut{f 2 ) = S~^Aut{fi)6. 

Definition 5 

Multimodels {Gi,<Pi, Fi) and {G 2 ,d> 2 , F 2 ) are automorphic equivalent if there is 
a bijection a : Fi ^ F 2 for which the models (Gi,<?i,/) and (G 2 ,'? 2 >/“) ot’e 
automorphic equivalent for all f € Fi. 

Theorem 2 

Knowledge bases KBi and KB 2 with finite G\, G 2 are equivalent if and only if 
the multimodels {G\,<Pi, Fi) and {G 2 ,d> 2 , F 2 ) are automorphic equivalent. 

These two theorems allow to build an algorithm of verification of knowledge 
bases isomorphism and equivalence. 

Proofs of these results are based on a special Galois-Krasner theory which is 
built for this particular situation. 

Example 

There are a lot of automorphic equivalent but not isomorphic boolean models, 
that certifies that isomorphism is much more strict condition for informational 
equivalence. For example, let an arbitrary boolean algebra B be given. Consider 
a ternary symbol of relation ip which is realised on B in two ways: for a,b,c G B 

{a,b,c) e fi{ip) <t4> ayb=c, 

(a,b,c) G f 2 ((p) aAb=c. 

Denote the corresponding models {B, ip, fi) and {B, p, / 2 ). They are not iso- 
morphic. Indeed, let these models be isomorphic with the isomorphism S. Then, 
from a V & = c it follows, from the one hand, S{a) V S{b) = S{c), but from the 
other hand S{c) = 6{a) A 6{b). Contradiction. 

However Aut{fi) = Aut{B) = Aut{f 2 ). Groups Aut{fi) and Aut{f 2 ) are 
conjugated by an identical automorphism of the algebra B. This means that 
these models are automorphic equivalent and the corresponding knowledge bases 
are informationally equivalent. 
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Quantum computers are the only model of computing to credibly violate the 
modified Church- Turing thesis, which states that any reasonable model of com- 
putation can be simulated by a probabilistic Turing Machine with at most poly- 
nomial factor simulation overhead. This is dramatically demonstrated by Shor’s 
polynomial time algorithms for factorization and discrete logarithms ^2] • Shor’s 
algorithm, as well as the earlier algorithm due to Simon HH can both be cast 
into the general framework of the hidden subgroup problem (see for example jOI). 
Two recent papers n study how well this this framework extends to solving 
the hidden subgroup problem for non-abelian groups (which includes the graph 
isomorphism problem). 

Indeed, there are very few superpolynomial speedups by quantum algorithms 
that do not fit into the framework of the hidden subgroup problem. One example 
is the recursive fourier sampling problem [Q , which provided early evidence about 
the power of quantum algorithms. Very recently, van Dam and Hallgren give a 
polynomial time quantum algorithm for solving the shifted Legendre symbol 
problem |3j; the algorithm does not appear to fit into the HSP framework. 

There is another class of quantum algorithms that have their roots in Grover’s 
search algorithm, which gives a quadratic speedup over brute force search. There 
is a matching lowerbound |2j , thus showing that it will be hard to find polynomial 
time quantum algorithms for NP-complete problems. 

Finally, last year an intruiging new paradigm for the design of quantum al- 
gorithms by adiabatic evolution was introduced |S|. Encouraging results from 
numerical simulations of this algorithm on small instances of NP-complete prob- 
lems appeared in the March issue of Science |0|. Very recently, a few analytic 
results about this new paradigm have been obtained P] — first, adiabatic quan- 
tum computing is truly quantum, since it gives a quadratic speedup for general 
search. Secondly, there is a simple class of combinatorial optimization problems 
on which adiabatic quantum algorithms require exponential time. 
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Abstract. The superposition (or composition) problem is a problem 
of representation of a function / by a superposition of ’’simpler” (in a 
different meanings) set Q of functions. In terms of circuits theory this 
means a possibility of computing / by a finite circuit with 1 fan-out gates 
n of functions. 

Using a discrete approximation and communication approach to this 
problem we present an explicit continuous function / from Deny class, 
that can not be represented by a superposition of a lower degree functions 
of the same class on the first level of the superposition and arbitrary 
Lipshitz functions on the rest levels. The construction of the function / 
is based on particular Pointer function g (which belongs to the uniform 
AC°) with linear one-way communication complexity. 



1 Introduction 

In complexity theory the superposition approach provides a new proof of the 
separating of monotone NC^ from monotone P [KaR aWi'] . In classic mathematic 
the problem of representation of functions by functions of ’’simpler” (in some 
sense) quality has a long history and is based on the following problem. It is 
known that a common equation aix" -I- 02 a;"“^ -!-••• + -I- Qn+i = 0 for n < 4 

can be solved over radicals. In terms of the superposition problem this means 
that the roots of the equation can be represented by a superposition of arithmetic 
operations and one variable function of the form ^/a (n = 2, 3) of coefficients 
of the equation. Galois and Abel proved that a common equation of the 5-th 
order can not be solved in radicals (can not be represented as a superposition 
of this special form) . Hilbert , formulated the 13-th problem the problem of 
representing a solution of a eommon equation of the 7-th order as a superposition 
of functions of two variables. The importance of the 13-th Hilbert problem is 
that it demonstrates one of the points of growth of function theory: it motivated 
an investigation of different aspects of the superposition problem. 
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99-01-00163 and Fund ’’Russia Universities” under the grant 04.01.52 
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Arnold nn and Kolmogorov IE2I proved that arbitrary continuous func- 
tion f{xi, . . . ,Xk) on [0, l]^can be represented as a superposition of eontinuous 
functions of one variable and sum operation: 

2fc+l / 

f{xi,...,Xk) = X] 

Note that the functions hij are chosen independently from / and as it is proved 
in the functions hij belong to Holder class. 

Vitushkin made an essential advance in the investigation of the superposition 
problem. Let tFp denote the class of all continuous functions of k variables which 
has restricted continuous partial derivatives up to the p-th order. Vitushkin (see 
a survey m) proved the following theorem 

Theorem 1. There exists a function from which can not be represented by 
a superposition of functions from if ^ 

Later Kolmogorov gave a proof of Theorem E that was based on compar- 
ing complexity characteristics (entropy of discrete approximation of functional 
spaces) of classes and Kolmogorov’s proof shows that the classic su- 
perposition problem has a complexity background. The notion of entropy of 
functional spaces that was introduced by Kolmogorov was a result of the in- 
fluence of Shannon’s ideas. Note that Kolmogorov’s and Vitushkin’ s proofs show 
only the existence of the functions of Theorem^ and do not present an example 
of a funetion from that ean not be represented by a superposition of functions 
from J-g . See the survey m and for more information on the subject. 

Further advance in presenting “constructive” continuous function which is 
not presented by certain superposition of simpler functions (“hard continuous 
function”) was made in the paper It was proved that a function fa G 

that is defined by a most hard (in terms of general circuits complexity) boolean 
function G can not be represented by a superposition of functions from if 
I > |. Remind that almost all boolean functions are hard, but an explicit 
example of hard boolean function is not known, yet. 

In this paper we generalize results of EEl where first example of explicit 
“hard continuous function” has been presented. We use a discrete approxima- 
tion of continuous functions and the communication complexity technique for 
the investigation of the superposition problem. Using certain Pointer boolean 
function g from the uniform AC^ with the linear one-way communication com- 
plexity we define an explicit continuous function that can not be represented by 
a superposition of a lower degree functions of the same class on the first level of 
the superposition and arbitrary Lipschitz functions on the rest levels. 

The proof method of this result based on the following. Having continuous 
function / we suppose that it is presented by a superposition S of some kind of 
continuous functions. We consider their proper discrete approximations df and 
DS and compare the communication complexity Cdf and Cos of df and DS 



k 

i=i 



(xj) 



( 1 . 1 ) 
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respectively. By showing Cos < Cdf we prove that / can not be presented by 
the superposition S. 

The theoretical model for the investigation of communication complexity of 
computation was introduced by Yao |Yao| . We refer to books |HrlKulNi| for more 
information on the subject. 



2 The Function 



We define explicit continuous function of k arguments on the cube [0, l]^by 
explicit boolean function g (more precisely sequence g = {gn} of explicit boolean 
functions). Informally speaking our construction of fuj^g can be described as 
follows. We partition cube [0,l]*to the infinite number of cubes (to 2^" cubes 
for each n > Uq). We encode each of 2^” cubes by binary sequences of the length 
kn. Boolean function fi^^g over kn arguments determines the behavior of fuj,g in 
each of 2^" cubes. 

Now tern to the formal definition of fuj,g- We consider n = 2-1 — 1, j > 1 
throughout the paper in order not use ceiling and floor brackets. Let In = 
be a closed interval, = I„ x ■ ■ ■ x In , and l'^ = Un>i bet 

k 

E = {0,1}. We consider the following mapping a : E* — >■ [0,1]. For a word 
u = iJi . . . (T„ we define 



a{v) = 



n + 1 



1 + CTi2 
V i=l 



2«+i 



Denote = |a(u) : v S Y"}. For a number a{v) S denote 



In{a{v)) 




1 

(n+ 1)2"+!’ 



a{v) 



1 

(n+ l)2”+i 



a closed interval of real numbers of size S{n) = • From the definitions of 

An and /„(a(u)) it holds that: 

1. For a{v), a{v') G An and a(v) ^ a{v') segments /„(a(u)), and In{a{v')) 
can intersect only by boundary. 

Ua(?;)GA„ bn(a(^^)) = In 

Let US define the function I'n,a{v){x) on the segment J„(a(u)), a{v) G An as 
follows 



[ ^ <x< a{v) 

'I'n,a{v){x) = \ 1 — — a{v)), o(u) < X < a(v) + (2.2) 

[ 0, else 

From the definition it follows that the function I'n,a(v)i^) reaches the max- 
imum value 1 in the center of the segment In{a{v)), a(v) G An and value 0 in 
the border points of this segment. 
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For a sequence v = (vi,...,Vk), where Vi G A'", 1 < z < A: , denote 
/^(6(w)) = I„(a(vi)) X ••• X In{a{vk)) a fc-dimension cube of size 6{n), where 
b{v) = {a{vi), . . .,a{vk)). 

Consider the following continuous function ^n,b(v){x) inside each cube 
I^{b{v)), V = {vi,...,Vk) G Sn, b{v) = {a{vi),...,a{vk)). 

k 

i=l 

Function ’Pn,b(v)(x) has following important properties: it reaches the maxi- 
mum value 1 in the center of the cube I^(b(v)); for all border points x of cube 
I^(b(v)) it holds that 'l'n,b(v)(x) = 0. 

Let g = {g„(v)}, where 

: r” X • ■ ■ X r" ^ {0, 1} 

k 

be the sequence of the following Pointer boolean functions. For a sequence v = 
{vi, . . . ,Vk), where Vi G if", 1 < z < fc , we will consider the following partition 
pat{n, k): each word Vi of the sequence v is divided into two parts: the beginning 
Ui and the end Wi of length l{n,k) = n — d{n,k) and d{n,k) = [(log A:n)/fc] 
respectively. We will write v = (u,w) and call u = (zzi, . . . ,Uk) the first part of 
the input sequence v and w = (wi, . . . ,Wk) the second part of the input sequence 

V. 

Function gn{u, w) = 1 iff {ord{wi . . .Wk) + l)-th hit in the word U\ . . .Uk is 
one (ord{d) denotes the integer whose binary representation is d. The numer- 
ation of bits in the words starts from 1). We will use both notation gffv) and 
gn{u,w) for the boolean function gn- 

The function g„ can be formally described by the following formula: 

kd{n,k) 

9n{u,w) = y f\ yffAXord(cr), 

<y i—1 

0<ord{<7)<\u\ — l 

where pj (xj) is the j-th symbol of the sequence w (zz) in the common numer- 
ation of its elements. Clear that g is in the uniform class E 2 {AC'^). 

Let to{S) be a continuous function such that lim5_>ow((5) = 0. Define a con- 
tinuous function f^^g on cube [0, 1]^ as follows: 

U,g{x)= Y C^dniv) -l)uj{6{n))Fr,^b{v){x), (2.3) 

n=2^-l, 

j>k 



3 The Result 

Remind definitions from functions theory. Denote C to be a class of continuous 
functions of fc > 1 variables which are defined on closed cube [0, 1]^. It is known 
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that functions from C are uniformly continuous. Following functions theory for 
each f{xi, . . . ,Xk) € C define modulus of continuous ojf{S). That is, ojf{6) is a 
least upper bound of \f{x) — f{x')\, for all x,x' £ [ 0 , 1 ]^ such that |a: — x'\ = 
maxi<j<fc \xi - x'f\ < 6. 

We use the following standard definitions. Denote 

Tduj = {/ G C : LOf{S) < Mlo{5), for some M > 0}, 

Denote an essential subset of Hu;- That is, 

Hui = {f £ C : Miuj{5) < 0Jf{5) < M2to{S), 

for some Mi, M 2 > 0. The following classes are known as Holder classes in 
functions theory: 

n^ = {f £C: cof{S) < M6^, for M > 0} (76 (0, 1])). 

The following properties are known as classic properties: 

1 . Uy dUiiii < 7 '. 

2. Well known class T G C oi continuous functions which have continuous 
derivatives is a proper subclass of "Hi. The class "Hi is known also as Lipschitz 
class. 

3. Class TLy — is a class of constant functions if 7 > 1. 

More general class of functions 

= |/ G C : w/(,5) log ^ = o| 

is known as Deny class. Deny class contains Holder classes properly. 

Let p > 1, a = l/(eP+^), and 



,, (x\- j if 0 < X < a 

' |_l/(lnl/a)P if X > a, 

Class 'Huip is a subclass of Deny class V. 

Let 17 be some set of functions. We define the superposition of functions of 
C as a function computable by a leveled circuit with a constant number of 1 
fan-out gates from the set 17. 

Theorem 2. Function faip,g{x) over fc > 4 variables belongs to the class 
and is not represented by a following superposition of functions: 

1. Superposition contains on the first level functions oft, t < k, variables from 
the class Hf, . 

LOp 

2. Superposition contains arbitrary continuous functions from Hi on the re- 
maining levels of superposition. 
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Below we present more general theorem |3 Theorem |3 is a corollary of it. 
Let A*, B® be some classes of continuous functions of t and s variables. Define 
class of continuous functions of k variables that can be represented 
by a superposition of the following form 

F hsixl, a;®)) , 

where F(yi,...,ys) is a function from class B®, and {hi{xi, . . . ,Xt) : 
1 < i < s } C A*. 

From the definition it holds that for modules of continuous i^ 2 {S) 

it holds that the function uj{S) = uj2{ix!i{6)) is a modules of continuous and 

Theorem 3. Let o;i((5) he an increasing function such that does not in- 

crease when 6 increase and 



Then for s>l, M> 0,7 s (0,1], u; 2 {S)=MS'^, uj (6) = u> 2 {iOi{S)) function 
fuj.gix) belongs to J. 

The proof of general theorem 0we present in the next section. 

Proof of theorem O First. Function ojp{S) satisfy the conditions for the 
uJi of theorem Elfor arbitrary constant c > 0 and especially for c = 1 — t < k. 
Next. Superposition of arbitrary functions from the class "Hi is again a function 
from TLi- From theorem 0 results the statement of theorem 0 LI 



4 The Proof 

The proof of the fact that f^^g G results from the following property. 
Property 1. For the function f^^g it holds that 

1. In each cube G function f^^g gets its maximum (minimum) value 

oj(S(n)) (—oj(S(n))) in the center and value zero in the border of the border 
of the cube I^(b(v)). 

2. If in addition function lo(S) is such that does not increase when S increase 
then for modules of continuous w/ of the function fi^^g it holds that 

a) w((5(n)) < u!f{S{n)) < 2koj{5{n)). 

b) for arbitrary <5 0 Jf{S) < 2kui{6). 
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Proof. The proof use standard technique from functions theory. It will be 
presented in a complete paper. □ 

The proof of the second part of the theorem 0 use communication complexity 
arguments and is based on computing communication complexity of discrete 
approximations of the function fuj,g- 

Let / be an arbitrary continuous function defined on the cube [0, 1]^. Denote 
a{n) = min{f{x) : x € ll^ = [^, and /3(n) = max{/(a;) : x G /^ = 

[1 ipi 

Definition 1. Let f{xi, . . . ,Xk) be a continuous function on the cube [0,1]^. 
Call a discrete function df : x • • • x if" — ?> [a{n), P{n)] an e{n)-approximati- 

k 

on of the function f{x \, . . . , Xk), if for arbitrary v = (vi, . . . , Vk) G If" x • • • x If" 
it holds that 

\f{b{v)) - df{v)\ < e(n). 

We will use the standard one-way communication computation for computing 
the boolean function G g. That is, two processors and obtain inputs in 
accordance with the partition pat(n, k)oi input v. The first part u = (ui , . . . , Uk) 
of the input sequence v is known to Pu and the second part w = (wi, . . . , Wk) of 
V is known to Pu,. 

The communication computation of a boolean function gn is performed in 
accordance with a one-way protocol ip as follows. sends message m (binary 
word) to Pu,. Processor P^, computes and outputs the value gn{u,w). The com- 
munication complexity of the communication protocol ip for the partition 
pat{n, fc)of an inputs v = (ui, . . . , Vk) is the length \m\ of the message m. 

The communication complexity Cg^(pat(n,k)) of a boolean function g^, is 
min{C'i/, : ip computes gn}. 

Lemma 1. For the boolean function gn & 9 it holds that 

^)) ^ “ 1) ~ logfcn. 

Proof. With the function gn{u,w) we associate a x commu- 

nication matrix CMg^ whose (u,w) entry is gn{u,w). 

Using the fact that Cg^{pat{n, k)) = [log nrow(CMg„)] , where nrow{CMg^) 
is the number of distinct rows of communication matrix CMg^ (see | |Yao| l and 
the fact that for the gn it holds that nrowiCMg^) = ^ ^ we obtain 

the statement of the lemma. □ 

We will use the same one-way communication computation for computing 
a discrete function df(v). Let pat(n,k)he a partition of input v, v = {u,w). 
Let Pu and Pu, be processors which receive inputs according to pat{n,k). Let 
(p{pat(n, k)) be a one-way communication protocol, which compute df{u, w). The 
communication complexity C,p of the <p{pat{n,k)) is the total number of bits 
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transmitted among processors Pu and Pw et The communication complexity 
Cdf{pat{n, k)) of a discrete function df we define as follows 

Cdf{pat{n,k)) = minjC^ : 4>{pat{n,k)) compute df{v)}. 



Definition 2. Define a communication complexity Cf{pat{n,k),e{n)) of an 
e{n) -approximation of the function f as follows: 

Cf{pat(n,k),e{n)) = min{Cdf{pat{n,k)) : df{v) — e{n)~ approximation of 

/}• 



Lemma 2. For e{n) < uj{5{n)) , for arbitrary e{n)-approximation df of the func- 
tion fui,g it holds that 



< Cf{pat{n,k),e{n)). 

Proof. Suppose that 

^)) > Cf{pat{n,k),e{n)). (4.5) 

This means that there exists an e(n)-approximation df of the function 
f{xi, . . . , Xk) such that for x communication matrices CMg^ and 

CMdf of functions pn and df it holds that 

nrow{CMg^) > nrow{CMdf). 

From the last inequality it follows that there exist two inputs u and u' such 
that two rows roWg^{u) and roWg^{u') are different but two rows roWdf{u) and 
rowdf(u') are equal. This means that there exists an input sequence w for which 
it holds that 



gn{u,w) ^ gn{u ,w), (4.6) 

df{u,w) = df{u',w). (4.7) 

Let gn{u,w) = 1, gn{u',w) = 0. Let us denote v = (u,w), v' = (u', rc).Then 
form the definition of the fui,g we have: 

/u;.g(&(t’)) = w(^(n)), (4.8) 

fu,g{h{v')) = -w((5(n)). (4.9) 

From the definition of the £(n)-approximation of the f^j^g and the property o 
it holds that 



\fij,g{b{v)) - df{v) \ < e{n) < Lo{S{n)), 
\U,g{bW)) - df{v’)\ < e{n) < u}{S{n)). 



(4.10) 

(4.11) 
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From our conjunction that df(v) = df{v'), from (14.8II . (14. hi . 14.1 01 . and (14.1 1 1 
we have that 



2w(5(n)) = \U,g{b{v)) - U,g{b{v')) \ < 

< \U,g{b{v)) - df{v)\ + \fc,,g{b{v')) - df{v')\ < 2u;{S{n)). 

The contradiction proofs that df{v) yf df{v'). □ 

Let dhi : If" x • • • x if" — t-Z, l<i<t,bea discrete functions and DF : 
^ ^ ^ — — 
t 

if" X • • • X if" — >■ Z, here Z denote the set of real numbers, be a following 

k 

discrete function: 



DF = F{dhi{vl, vl), dhs{vl, Vt")), 
where function F(yi, ... ,ys) is an arbitrary continuous function. 

Lemma 3. For a discrete function DF(vi, . . . , Vk)it holds that 

S 

CDF{pat{n, fc)) < ^ Cdhi {pat{n, k)) 

i=l 

Proof Communication protocol (f>*{pat{n,k)) for the function DF consists 
of processors P* and P^. Given an input u,w (f* {pat{n, k)) simulate in parallel 
protocols (pi{pat{n,k)), <f> 2 {pat{n,k)),. . . , (fs{pat{n,k)) which computes dh\, 
dh 2 , • • ■ , dhs , respectively. The processor P* on received a message from P* and 
the input w computes outputs yi,. . . ,ys of protocols 4>\{pat{n, fc)), </> 2 (pat(n, fc)), 
. . . 4>s{pat{n, k)) and then computes and outputs a value F{y), y = (yi, . . . , ys). □ 



Lemma 4. Let functions iOi , UJ 2 satisfy conditions of the theorem 0 and let 
uj{6) = oj 2 {u>i{S)). Let the function fi^^g{xi, . . . ,Xk) can he represented as a su- 
perposition of the form 

F {hi{x\,...,xl),..., h,{x{, ..., xt)) , 

where F G and {hi{xi, . . . , Xt) : 1 < i < s } C . 

Then there exists an e'{n) < uj{S{n)), such that 

Cu,^{pat{n, fc),e'(n)) = o(n). 

Proof. We will denote / our function f^^ g in the proof of the theorem. 

Let e = tui (S(n)) / log Consider arbitrary function h G {hi, . . . ,hs}. 

Let Q!(n) = min{/i(x) : x G L{, = [^, f]*}, and /3(n) = max{/i(x) : x G = 
[-,-]*}• Let 

Ln ’ nJ J 



Pe(„) = AU{P{n)}, 
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where 

A = jtti : tti = a{n) + e(n)z, z G {0, 1, ■ • ■ , } ■ 

Due to selection of the value e(zz), from the condition (15.411 of the theorem 0 
and from the equality 5{n) = have that: 

|7^e(„)| = (4.12) 

Or |7?.£(„)| < 2"*. This means that there exists an e(n)-approximator dh of 
continuous function h, 



dh : X ■ ■ ■ X r" ^ 7^e(„). 

t 

For the prove of the statement of the lemma we show that the discrete func- 
tion 

DF{vi,. ..,Vk) = F{dhi{vl, dh,{vl , .... <)) 

is the e'(n) -approximation of the function / and 

C]jF{pat{n,k)) = o(n). (4.13) 

Let V = (vi, . . . ,Vk) G A’"x • • • xi7". First we prove that for some s'(n) < uj(S) 
it holds that 

lf(b(v)) - DF(vi, . . .,Vk)\ < e'(zz). (4.14) 

Denote x = {xi, . . . ,Xk) = b{v) = (a(zzi), . . . , a(ufc)). Due to the fact that 
for each z G {1, 2, . . . , s} function dhi is e(rz)-approximation of the continuous 
function hi it holds that 



\h,{x\,...,xl) - dhi{v\,...,vl)\ < e{n). 
As function w(J) decreases when 6 decreases then we have 
\F {hi{x \, . . . , x^), hs(xf, x^)) - 
F(dhi(v^, . . .,vl), . . .,dhs{vl, . . . ,<))| < 



< ujF{s{n)) < 



Ml 



From some ng for n > ng it holds that 

e'(n) < uj(S(n)). 



(u;i (d(n))y = e'(n). 



Last inequality proves (f4. 1 4|l . 

Consider now an arbitrary discrete function dh from {dhi, . . . , dhs}. 
It is sufficient to prove that 



Cdh{pat{n,k)) = o(n) 



(4.15) 
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Then using lemma |3 the H4.1 .SB results. 

With the function dh we associate a x communication matrix 

CMj_h{n) whose (u,w) entry is dh{u,w). 

Cdh{pat{n,k),e{n)) = \ log nr ow {CM dh{n))']. (4.16) 

Clearly we have that 

nrow{CMdh{n)) < min \R.s{n) 

or 

nrow{CMdh{n)) <\R-s{n)\^ ■ (4-17) 

From the definition of the partition pat{n,k) we have that d{n,k) = . 

Using CT7I) . KWi for the equality we obtain inequality (E~T^ . □ 

Finally combining statements of lemmas ilH and n we obtain the proof of 
the theorem El 

5 Concluding Remarks 

The communication technique in this paper gives a clear information explanation 
of the statements of theorems Q and Q That is, functions h from the class TLlj^ 
which satisfies the condition (O) of the theorem 01 can be approximated by 
discrete functions dh with small communication complexity o{n) (see l|4. 1 51) 1. 
Such discrete functions dh on the first level of superposition “can mix” some 
different inputs during transformation and no functions on the remaining levels 
can reconstruct this information. 

We conclude with open problems. Whether using discrete approximation to- 
gether with communication technique is possible to present an explicit function 

(i) from TLi which could not be presented by a superposition of functions 

from if t < k; 

(ii) from which could not be represented by a superposition of functions 

from if I > I ? 

Acknowledgment. We are grateful to Marek Karpinski for helpful discussions 
on the subject of the paper. 
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Abstract. In this paper we introduce a model of a Quantum Branching 
Program (QBP) and study its computational power. We define several 
natural restrictions of a general QBP model, such as a read-once and a 
read-fc-times QBP, noting that obliviousness is inherent in a quantum 
nature of such programs. 

In particular we show that any Boolean function can be computed deter- 
ministically (exactly) by a read-once QBP in width 0(2"), contrary to 
the analogous situation for quantum finite automata. Further we display 
certain symmetric Boolean function which is computable by a read-once 
QBP with O(logn) width, which requires a width S7(n) on any deter- 
ministic read-once BP and (classical) randomized read-once BP with 
permanent transitions in each levels. 

We present a general lower bound for the width of read-once QBPs, 
showing that the upper bound for the considered symmetric function is 
almost tight. 



1 Introduction 

Richard Feynman observed in 1982 (0) that certain quantum mechanical effects 
cannot be simulated effectively on a classical computer. This observation led to 
the general idea that perhaps computation in general could be more efficient in 
some cases if it uses quantum effects. During the last decade the area of research 
and developing of such algorithms that use different theoretical quantum compu- 
tational models became an intensively growing area. Shor’s quantum algorithm 
for factoring m that runs in polynomial time is well known. 

As it is mentioned in jS] quantum computers may have two parts: a quantum 
part and a classical part with communications between these two parts. In that 
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case, the quantum part will be considerably more expensive than the classical 
part. Therefore, it will be useful to make the quantum part as simple as possible. 
This motivates the study of restricted models of quantum computations. 

During the last decade different restricted quantum computational models 
have been investigated. In particular quantum finite automata has been intro- 
duced and first investigated by Kondacs and Watrous see for more infor- 
mation on the subject. It has been shown that one-way quantum finite automata 
with bounded error cannot accept arbitrary regular languages . But Ambainis 
and Freivalds 0 presented certain regular language which can be presented by 
quantum finite automata with bounded error much (exponentially) cheaper than 
by the classical randomized finite automates. 

In the paper we introduce a model of quantum branching programs. A 
branching program is a very convenient model for implementing different re- 
stricted variants of the computational model. Leveled oblivious permutation BPs 
are well known in complexity theory, their computational power is strong enough 
(deterministic leveled oblivious permutation constant width BPs have the same 
power as logn depth circuits 0). Next, from our point of view branching pro- 
grams are very natural model for comparing computational power of quantum 
models with the deterministic ones. 

Note that from quantum point of view deterministic leveled oblivious per- 
mutation BPs model of BPs are a particular case of quantum BPs. 

In the paper we investigate the properties of the important restricted compu- 
tational variant — quantum read-once BP. We first show that read-once exact 
quantum BPs (noting that obliviousness is inherently in a quantum nature of 
such programs) can compute arbitrary Boolean function. Next we display cer- 
tain symmetric Boolean function which is computable by read-once QBP with 
O(logn) width, which requires width J7(n) on any deterministic read-once BP 
and (classical) randomized read-once BP with permanent transitions in each lev- 
els. We present a general lower bound for the width of read-once QBPs, showing 
that the upper bound for the considered symmetric function is almost tight. 



2 Preliminaries and Definitions 

Consider a d-dimensional Hilbert complex space TL‘^ with a norm ||.||. That is, 
for z G z = {zi , . . . , Zd}, ||- 2 :|| = Denote a sphere of radius 

1 of the space = {z •.\\z\\ = 1}. 

Recall some basic notations from quantum mechanics. A pure quantum state 
(or superposition) of a quantum system QS with d stable states {1, . . . , d} (d- 
state QS) can be expressed by associating an amplitude Zi (complex number) to 
each state i of QS. Quantum mechanics uses for this the following notations. Con- 
sider quantum basis states {|1), . . . , |d)} where {|f)} is the set of d-dimensional 
orthogonal basis vectors of where |i) denotes the unit vector with value 1 at 
i and 0 elsewhere. A pure quantum state or eonfiguration of QS can be specified 
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as 

d 

1 ^) = 

or just 1^) = (zi, . . . ,Zd) where |^/>) G C'^. The specific notation \tp) is so called 
Dirac notation for expressing column- vector (zi, . . . , Zd)- Element Zi of |i/') is 
called amplitude of the basis state |z) of QS, and |zjp is the probability of 
finding QS in the state i when QS is measured. 

Time evolution of configurations of QS in discrete steps is reversible and 
conveniently expressed using Heisenberg’s matrix mechanics. That is, if in a 
current step a configuration of QS is \'ip) then in the next step a configuration 
of QS would be where \ip') = U\ip) and U is d x d unitary matrix. 

Now we formally define a quantum transformation as follows. Let X = 
{xi, . . . ,Xn} be a set of Boolean variables. Define quantum transformation (d- 
dimensional quantum transformation) on li/;) G as a triple (j, C/(0), C/(l)) 
where j is the index of variable Xj G X and C/(0), C/(l) are reversible transfor- 
mations of presented by unitary d x d matrices. Quantum transformation 
(j, C/(0), [/(!)) of IV') acts as follows: C|V') = If’')- If Xj = 1 then U = C/(l) else 
[/= C/(0). 

2.1 Definition of a QBP 

A Quantum Branching Program of width d and of length I ((d, l)-QBP) based 
on QS is defined by a 

P=(T,|V'o),E) 

where T is a sequence (of the length 1) of d-dimensional quantum transformations 
of d-state QS: 

T=((ji,C/.(0),C/.(l)))Li, 

IV’(O)) is the initial configuration of P. F C d} is the set of accepting 

states. We define a computation on P for an input cr = cti, . . . , cr„ G {0, 1}” as 
follows: 

1. A computation of P starts from the superposition |V’o)- On the t-th step, 
1 < i < V of computation P transforms superposition If:) to a superposition 

IV'') = C/*(crjJ|V’)- 

2. After the Lth (last) step of quantum transformation P measures its config- 
uration IV’o-) where |V'cr) = bV(o’i,)LV_i(cri,_J . . . [/i(CTij)|V'o) • Measurement 
is presented by a diagonal zero-one projection matrix M where Mu = 1 if 
i € F and Mu = 0 if i ^ F. The probability Pacceptic) of P accepting input 
a is defined by 

Paccept{<j) = ||M|V'a)|P- 

Denote by fcQBP a QBP with the restriction that each variable x G 
{x\, . . . ,x„} occurs in the sequence T of quantum transformations of P at most 
k times. 
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2.2 Function Presentation 

— A QBP P is said to compute (compute with an unbounded error) a Boolean 
function /„ : {0,1}” — >■ {0,1} if for all a G /~^(1) the probability of P 
accepting a is greater than 1/2 and for all a G /~^(0) the probability of P 
accepting a is at most 1/2. 

— A QBP P computes /„ with bounded error if there exists an e > 0 such that 
for all a G /“^(l) the probability of P accepting a is at least 1/2 + £ and 
for all cr G /“^(O) the probability of P accepting a is at most 1/2 — £. We 
call £ the margin and say that P (1/2 + £)-computes /„. 

— We say that a QBP P exactly computes /„ if P computes /„ with the margin 
1/2 (with zero error). 

3 Computational Properties 

First we show that bounded error read-once QBPs are powerful enough to com- 
pute arbitrary Boolean function. In contrast, we notice that one-way quantum 
finite automata when accepting with bounded error can only accept a proper 
subset of regular languages [9]. See also ^ for the last results on complexity 
properties of quantum finite automata. 

Property 1. For arbitrary Boolean function /„, there exists (2",n)-lQBP that 
exact computes /„. 

Proof: The proof is evident. For example, the following (2”,n) — IQBP P is a 
QBP satisfied the proposition. All possible configurations of P are trivial. That 
is, a configuration |'0) of P contains exactly one 1 and all the rest components 
of IV') are 0. An initial configuration of P is |V'o) = (1)0,..., 0). P reads input 
variables in order Xi^X 2 , ■ ■ ■ ,Xn ■ 

In each step i, 1 < f < n , P reads input (Ji and transforms its current 
configuration jV") as follows. If (Xi = 0 then |V’) does not changed. If at = 1, then 
the 1 of the configuration |V') is “moved” to 2”“* positions to the right in the 
next configuration |V’^. 

For an input sequence a — ui, . . . denote l„ the number of position of 1 
in the final (after reading a) configuration of P. Clearly we have that Q ^ la' 
iff CT o' . 

Now determine the set of accepting states P of P as follows: if f{a) = 1, 
then qi^ G F. If f{a) = 0, then ^ F. □ 



Let the class EP-QBPconst be the class of all Boolean functions exactly 
computed by constant width and polynomial length (in the number of function 
variables) . 

Property 2. For the complexity class NC^ it is holds that 



NC^ C F.-P-QBPaonst- 
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Proof: Proof is evident since known result of Barrington pj. Having width 5 
permutation deterministic branching program P which computes Boolean func- 
tion fn it is easy to construct a (const, poly)-QBP P' which exact computes /„. □ 

Consider the following symmetric Boolean function fn,pP For an input a — 
u\, . . . ,Gn S {0, 1}" we have fn,p„(cr) = 1 iff a number of ones in a is divisible 
by Pn, where is a prime and < n/2. 

Theorem 1. The function fn,p„ can be presented by a read-once (0(logpn),n)- 
IQBP with one-sided error e > 0. 

The proof of this theorem will be presented in the section below. We have 
clearly that any deterministic OBDD for fn,p„ needs width. 

Note that fn,p,, is based on regular language Lp considered in 0. For 
a prime p language Lp over a single letter alphabet defined as follows. 
Lp = {u : |u| is divisible by p}. In it is proved that for any e > 0, there is a 
QFA with O(logp) states recognizing Lp with probability 1 — e. 

We clearly have that any finite deterministic automaton for Lp needs at least 
p states. In it was shown that constant bounded error finite probabilistic 
automata also need at least p number of states to recognize Lp. The proof of this 
lower bound use the Markov chain technique (finite probabilistic automata over 
single letter alphabet is exactly a Markov chain). But for probabilistic OBDDs 
it is not the case (probabilistic transitions on the different levels of OBDD can 
be different). Therefore we can not use directly the proof method of ^ for 
general probabilistic OBDD case. But in a particular (known enough) case when 
the transitions (for 0 and 1) in each level are the same we can use Markov 
chain technique for proving linear (in pn) width for presentation /n,p„ in such 
probabilistic OBDDs. 



3.1 Proof of the Theorem ^ 

We construct a QBP P accepting inputs a G /,7,p„(l) with probability 1 and 
rejecting inputs a G /,j)p„(0) with probability at least 1/8. Consider (2,n)-lQBP 
P^ for fc G {1, ■ . ■ ,Pn — !}■ Quantum program P'^ = {T^ , \ based on 2- 

state quantum system. Here = ((z, 17^(0), C/^(1)))JL^ where 



c/"(0) = (JJ),C'=(1) 



/ cos(27rfc/p„) - sin(27rfe/p„) \ , p. 

ysm{2nk/pn) cos(27rfc/p„) y dro/ 




{ 11 - 



Denote by l{a) a number of 1-s in the sequence a, l{a) — 

Lemma 1 After reading an input tr = cti, . . . , cr„ the superposition of P^ 

is 



\'lp) = cos 



27t l(a)k 
Pn 



| 1 ) 



- sm 



27t l(a)k 
Pn 



| 2 ). 



Proof: The proof is omitted. It is similar to that of in g]. 



□ 
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If the number of ones in an input a is divisible by p„ then 2tt l[a)k/pn is 
a multiple of 2tt, cos(2tt I (a) k/pn^ = 1, sin^27r Z(cr)fc/p„^ = 0 . Therefore all 
QBP accept inputs a G fn,p„(^) ^ith probability 1. 

Following ^ call P^ ’’good” for input a G fnp„(0) if reject a with 
probability at least 1/2. 

Lemma 2. For any a G /,/p„(0), at least {pn — l)/2 of all P^ are “good”. 

Proof: According to the Lemma Q after reading an input a = (Ti , . . . , cr„ the 
superposition of P^ is 



I')/) = cos 



27rl{a)k 



Pn 



|1) + sin 




Therefore, the probability of accepting the input cr G fn,p„(0) is 
cos^ {2 tt l{cr)k / pn^ . cos'^ ^2n I (a) k/p„'^ < 1/2 iff cos ^2n I (a) k/pn^ < l/\/2. 
This happens if and only if (^2tt l(a)k/pn^ is in [tt/ 4 + 27rj, 37 t/ 4 + 27rj] or in 
[57r/4+27rj, 77r/4 + 27Tj] for some j G N. l{a)k / pr^ G [7r/4 + 27rj, 37r/4+27r/] 

iff (^2tt (I (a) k mod pn)/pn'^ G [tt/4, 37t/4]. p„ is prime, Z(cr) is relatively prime 
with Pn- Therefore, ^(CT)modp„, 2Z(cr) mod Pn, . . . , {pn — l)Z(cr)modp„ is 
1,2, .. . ,pn — 1 in different order. Consequently, it is enough to find the power 
of a set / = {ii, . . . ,q} C {1, ■ ■ ■ ,Pn — 1} such that 2Trij fpn G [7r/4,37r/4] or 
2TTij /pn G [57t/4, 77t/4]. a straightforward counting show that | J| > (p„ — l)/2. 

□ 



Following 0 call a set of quantum programs S = {P*L . . . ,P**} “good” for 
cr G fn^p^(O) if at least 1/4 of all its elements are “good” for this a. 

Lemma 3. There is a set S of IQBPs with = t = |"81np„] which is “good” 
for all inputs a G /,/p„(0). 

Proof: The proof is omitted. It is similar to that of in g]. □ 

We construct a IQBP P accepting inputs a G /,/p„(l) with probability 1 
and rejecting inputs a G /r/p„(0) with probability at least 1/8 as follows. The 
IQBP P consists of ’’good” IQBPs , ■ • ■ , P** }, which work in parallel. In the 
starting superposition of P all these programs represented with equal amplitudes. 

Inputs a G /r/p„(l) are always accepted with probability 1 because all P^s 
accept them. For any input a G /,/p„(0), at least 1/4 of all P^ G S reject it with 
probability at least 1/2 and the total probability of rejecting any a G fn^l^iO) is 
at least 1/8. 

Using the technique of the paper ^ it is possible to increase the probability 
of correct answer to (1 — e) for an arbitrary e > 0. In this case the width of 
IQBP will be 0(logp„). □ 
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4 Lower Bounds 

Below we present a general lower bound on the width of IQBP and compare it 
with the width of deterministic OBDD presenting the same Boolean function. 

Theorem 2. Let e G (0,1/2). Let fn be a Boolean function {1/2 + s)- computed 
(computed with margin e) by IQBP Q. Then it is holds that 



width{Q) = L2 



/ logwidth(P) \ 
\loglog width{P) J 



where P is a deterministic OBDD of minimal width computing fn- 

Next theorem presents more precise lower bound for a particular margin e of 
computation. This lower bound show that the result of the Theorem ^ is tight 
enough 

Theorem 3. LeteG {3/8, 1/2). Let fn be a Boolean function {1/2+ e)- computed 
(computed with margin e) by IQBP Q. Then it is holds that 



width{Q) = f2 



( log width{P) \ 
V21og(l + l/r)j 



where P is a deterministic OBDD of minimal width computing fn and 
T= 1^1 + 2e- 4^1/2 -£. 

Proofs of theorems are presented in the section below. 



4.1 Proofs of Theorems |2] and 

Proofs of theorems 00 use the same idea. We construct a deterministic OBDD 
P that presents the same function /„ and 



/ 2^\2width{Q) 

width{P) < ■ (4-1) 

Proofs of theorems 0 0 differ only in estimating parameter 0 > 0 depending 
on e. 



A Deterministic OBDD-Presentation of a IQBP. Let d = width{Q). 
Let 7T = {i\,i 2 , ■ ■ ■ ,in} be an ordering of variables testing of Q. From now we 
consider that input sequences cr G {0, 1}" are ordered in the order tt determined 
by Q. We define a deterministic OBDD LQ based on Q as follows. LQ use the 
ordering tt of variables testing and presented by the following labeled complete 
binary tree. 
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— The initial node of LQ is marked by intial configuration Itpo) of Q. Two 
outgoing vertices of the initial node are marked by = 1 and Xi^ = 0. 

— Two nodes of LQ on the level 1 are marked by configurations |t/’i(0)) and 
|'i/'i(l)) of Q where \tpi{ai)) is the configuration after the first step of com- 
putation after reading Xi-^ = ai for cti € {0, 1}. 

Vertex Xi^ = cti leads from the node \tpo) to the node \tpi{ai)) iff \tpi{ai)) = 
Ui{ai)\'tpo)- 

— Consider a level j of LQ. 2^ nodes of IQ of the level j are marked by 

configurations ■ ■ ■ (Jj)) £ : ai . . . aj £ {0, 1}^} where |V’i(o’i • • • <^j)) 

is a configuration of Q after reading the first part a\ . . . aj of input a G 
{0,1}”. 

Vertex (market by = '^j+i) from the node \'ipj{ai . . . aQ) 

leads to the node iff = 

— Consider the last level n of LQ. We mark 2" nodes of IQ on the level n by 

configurations |f/'ra(o’i . . . cr„)) G 'L : (cti . . . ct„) G (0, 1}" and in addition we 
mark them by 0 and 1 as follows. We mark node . . . cr„)) by 1 if for 

configuration \'ipn{o'i . . . a„)) it is holds that Paccept{o'i . . . aQ > 1/2 We 
mark node lipriQi . . . ct„)) by 0 if for configuration |V’n(o’i . . . cr„)) it is holds 

that PaccepticTi . . . (J„) < 1/2 - £. 

Property 3. A deterministic OBDD LQ presents the same Boolean function /„ 
as Q. 

Proof: Evident and follows from the construction of LQ. □ 



A metric Automaton Characterization of LQ. We view on OBDD LQ with 
the ordering tt of variables testing as a following metric time- variant automaton 
that reads its input sequences cr G (0, 1}" in the order tt: 

LQ =({0,1}, S', {<5, }”=i, I 

where {0, 1} is the input alphabet, = {IV')} is a set of states (set of all possible 
configurations of Q during its computations on inputs from {0,1}"). That is, 
= U/VgiL, where tf/,- is the set of states of LQ on the level j. Automaton 
transition function 5j : 'Lj-i x {0, 1} — >■ I'j determines transitions on the step j, 
^ ^ j ^ n, {6j is defined in according of transitions of LQ on the level j — 1). 
Finally |V’o) is the initial state and L/ = {|V’) G ■ ||M|V’)|P > 1/2 -|-£} is the 
accepting set of states of LQ. 

For J G {1, . . . , n} denote Aj : I'j-i x {0, l}"~l+i — >■ automaton transitive 

closer of the sequence 6j, . . . ,Sn of transition functions. That is, 



Aj{\ip),aj . . .an) = 6n{...{Sj{\ip),aj),...,an). 
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Lemma 4. Let fn be a Boolean function (^1/2 + e)-computed by LQ. Let 9 > Q 
and for arbitrary \ijj) G iFe and arbitrary \tpY € H holds that 

\m-w)\\>9. 

Then, there exists a deterministic OBDD B which computes fn and 



Proof: Recall known notions of metric spaces we need in the proof (see for 
example 0). A Hilbert space Tld is a metric space with a metric defined by 
the norm || • ||. Points fx, fi' from 'Hd are connected through 0-chain if there 
exists a finite set of points /ii , /i2 , . • . , from TLd such that pci = /i, pim = 
and \\pLi — pii+i\\ < 0 for * G {1, . . . , m — 1}. For Tld its subset C is called 0- 
component if arbitrary two points p,, p! G C are connected through 0-chain. It 
is known |3| that if is a finite diameter subset of a subspace of TLd (diameter 
of V is defined as sup^_^/gi5{||/r — /r'||} then for 0 > 0 2? is partitioned to a 
finite number t of its 0-components. Set of states of LQ belongs to sphere of 
radius 1 which has center (0, 0 , ... , 0) in TLd because for all lif) gT' \i holds that 
II IV’) II = 1- For each j G {0 , . . . , n} Denote by fLj] = {C \, . . . , Ctj} the set of 
0-components of Tj C TLd- 

From the condition of the lemma it follows that subset of is a union 
of some 0-components of Transition functions Sj, 1 < j < n, preserves the 
distance. That is, for arbitrary |V’) and |V) from Tj and arbitrary 7 G {0, 1} it 
holds that 



width{B) < ( 1 



III^)-|?)II = II^.(IV’),7)-<5,(I0,7)II- (4.2) 

From ( 14.211 it holds that for C G \^j\ and for 7 G {0, 1} there exists C G [iFj+i] 
such that 5j{C,^) = C . Here is defined as 7)- 

Now we describe deterministic OBDD B in terms of time-variant finite au- 
tomaton that computes /„. 

B=({ 0,1}, I'F], Co, F) 

where [F] = U"^Q[Fj] is a set of states of B (remind that [Fj] is a set of states 
on the step j of computation of B)\ 

Sj : [Fj_i] X {0,1} — >■ [Tj] is a transition function of B on the step j; 
an initial state Co = IIV'o)} is a one-element 0-component of Fq; finite set F of 
B defined as follows F = {Ci G [F„] : Ci C Fg}. 

From the construction of B we have that OBDD B and LQ compute the 
same function fn- The width width(B) of B is t = max{fo, ■ • ■ , Q}- 

Let t = tj We estimate the number t of 0-components (number of states of 
B) of Fj as follows. For each 0-component C select one point IV") G C. If we draw 
a sphere of the radius 0/2 with the center |V’) G C then all such spheres do not 
intersect pairwise. All the t these spheres are in large sphere of radius 1 -I- 0/2 
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which has center (0, 0, . . . , 0). The volume of a sphere of a radius r in T~Ld is 
where the constant c depends on the metric of TLd- Note that for estimating the 
volume of the sphere we should take in account that Tid is d-dimensional complex 
space and each complex point is a 2-dimensional point. So it holds that 



width{B) < 



c(l-b6>/2)^‘^ 




□ 



Below we present a technical lemma that estimates the number of components 
of for different e. 

Lemma 5. Let an LQ (l/2 + e)-computes function fn- Then for arbitrary \if) € 
Te and arbitrary \if') ^ it holds that 

1. II If/j) — I'i/'O II > ^*1 = sj'/d and 

2. II IV') - m \\>02 = ^1 + 2£-4v/I7^. 

Proof: For the simplification we denote a configuration \ f;) = Z\\l) -\ \- Zd\d) 

just hy if = {z\, , Zd). Let if = (zi, . . . , Zd) and if' = {z'^, . . . z'j). Consider a 
norm ||.||i defined as \\if\\i = J2i=i \^i\- 

1. From the definition of LQ it holds that 

2s < ^ (|z.P - k'P) = ^ (|z.| - \z'Mh\ + l^'l) < 

Si&F Si&F 

<2^ (|z,|-|z'|)<2^ |z,-z'|<2||V^-V>'||i 

SiGF SiGF 

Using an inequality 

-|- 02^2 + ■■■ + cidbd ^ ^ a\ af^ a^^J b\ b\ b'^^ (4-3) 

for 6i = &2 = ■ • ■ = = 1 we get that ||V’||i < v^llV’ll- Therefore, 

2e < 2||^ — if'Wi < 2'/d\\if — if'W 

Finally, we have 

H'0 — V’ll > e/'/d. 

2. Consider now the next variant of a lower bound for 1 1'!/) — ■i/;'| |. 

d 

11^ -^'11 = \ > 

\ i=l 



> 



> 
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From the definition of LQ we have that — 1/2 + e, J2si^F — 

1/2 + e. Now from the above we get that 



W^-i^'W > l/2 + e+l/2 + e-2 ^ \zi\\z[\-2 ^ |zi||z'|. 

V SiGF Si^F 



Using inequality we get from the above that 



11 ^ - i^'W > 



\ 



1+2S-2 / i: iF\z \4 p-\e i=.t,/E i=.'t- 

V SiGF V SiGF V Si^F V Si^F 



Using the property < 1/2-&, Es^gf < 1/2-^, kiP < 1> 

and Es — 1> finally get that 

\\^ - ^'11 > \/l + 2e- 4^1/2 -e = 02 - 

□ 



Note that the lower bound above for ||'(/) — '0'|| is nontrivial (posi- 
tive) if e £ (a, 1/2) where a is about 3/8. For e G (0,a] it holds that 
1 -I- 2e — 41^1/2 — e < 0. In this case the lower bound ||'i/' — > ej'/d is more 

precise. 

Now we turn to formal estimation of the lower bounds of Theorems 0 and 0 
Proof of Theorem 13 From Lemma 0 and Lemma 0 it follows that 



t < 1 + 



2y/d' 



2d 



or logt = O(dlogd). From that we get that 

V log log t 



Proof of Theorem m From lemma Eland lemma 13 it follows that 

2d 

or 2d > logt/ log(l -1-2/02). From this we have that 

logt 



d> 



21og(l-f 1/02)' 
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Abstract. We present an implementation that turns out to be most 
efficient in practice to compute singular moduli within a fixed floating 
point precision. First, we show how to efficiently determine the Fourier 
coefficients of the modular function j and related functions 72 , f 2 , and rj. 
Comparing several alternative methods for computing singular moduli, 
we show that in practice the computation via the 77 -function turns out 
to be the most efficient one. An important application with respect to 
cryptography is that we can speed up the generation of cryptographically 
strong elliptic curves using the Complex Multiplication Approach. 

Keywords: class group, complex multiplication, cryptography, elliptic 
curve, Fourier series, modular function, ring class polynomial, singular 
modulus 



1 Introduction 

Modular functions are studied for a long time by the number theory community. 
The most famous modular function is the j-function, which is holomorphic on 
the upper half plane f) and which has a single pole at infinity. However, given an 
element t G i) the computation of the value j(r) is in general a challenging task. 
In this paper we show how to efficiently compute j(r) in practice within a given 
floating point precision F. To be more precise, we compute a complex number 
jr such that \jr — j(t)| < 10“^, and we equate j(r) and jr- 

We compare several alternative methods to compute j(r) using the cube root 
72 of j, the Weber function f2, and Dedekind’s 77- function, respectively. To be 
able to compare the running times of alternative approaches for computing j(r) 
we first have to determine the Fourier series of these four functions. For the 
determination of the Fourier coefficients of j and 72 we make use of efficient 
algorithms due to Mahler [HiEZni. Furthermore, in the case of f2 and rj we 
develop efficient formulae in Sect. 0 

Efficient computation of j(r), 72(7"), and ^2{t) has an important applica- 
tion in cryptography: The generation of cryptographically strong elliptic curves 
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via Complex Multiplication (CM) 1 |A MHMj . [IBBOOp . The general CM- 

method requires the computation of a ring class polynomial R, which is in general 
the most time consuming step of the CM-method. Hence speeding up the com- 
putation of R yields a significant speed up of the CM-method. In Sect. El we 
present the relation between efficient computation of R and fast computation of 
singular moduli j(r). Thus our approach of efficiently computing singular moduli 
yields a significant speed up of generating elliptic curves using the CM-method. 
Furthermore, our most efficient approach of computing singular moduli has the 
advantage that no precomputation or no storage of any coefficients is needed. 

We tested our algorithms on two platforms: First a SUN UltraSPARC-IIi 
running Solaris 2.6 at 333 MHz and having 512 MByte main memory. Second 
a Pentium HI running Linux 2.2.14 at 850 MHz and having 128 MByte main 
memory. All algorithms are implemented in C-|— I- using the library LiDIA 2.0 
( [LIDIA] ) with libi as underlying multiprecision package and the GNU compiler 
2.95.2 using the optimization flag 02. Sample tests indicate that running times 
on the Pentium are about a quarter of the timings on the SUN. 

The paper is organized as follows: We first review the connection between 
modular functions and the CM-method in Sect. El Furthermore, we define the 
functions mentioned above. Then Sect.Elpresents efficient algorithms to compute 
the Fourier series of these functions. Finally, in Sect. 0 we compare different 
methods to compute singular moduli. 



2 The Ring Class Polynomial and Singular Moduli 

Let us review some basic facts on class groups and ring class polynomials. Ring 
class polynomials play a crucial role in generating elliptic curves for crypto- 
graphic purposes using the CM-method. A ring class polynomial R only de- 
pends on an imaginary quadratic discriminant A, that is we have Zi < 0, 
Z\ = 0, 1 mod 4. Each imaginary quadratic discriminant A uniquely determines a 
set C{A) = {{a,b,c) \ c= ^-^^,gcd(a, 6 , c) = 1, | 6 | < a < c, & > 0 if | 6 | = 

a or a = c}. C(Z\) is called the class group of discriminant A. The class group 
is a finite set whose cardinality h(A) is the class number. Furthermore we set 
hp{A) = #{(a, b, c) G C{A) : 6 > 0}. An element Q of the class group is called 
a reduced representative. 

With each reduced representative Q = (a,b,c) of the class group C{A) we 

associate an imaginary quadratic number tq : We set tq = with the 

imaginary unit i. Obviously we have tq G f) for all Q G C{A). 

We next define the Dedekind ry- function, the Weber function ( 2 , the function 
72 , and finally the modular function j. For r G f) we set q = . Then we have 

00 

ry(r) = g«-n(l-9”)> ( 2 - 1 ) 

n—1 
00 

h{r) = V2q^^ n(l + 9”) ’ 

n—1 



( 2 . 2 ) 
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72 (r) 



f2(r)24 + 16 

f2(r)8 



j(r) = 72 (t)^ . 



(2.3) 

(2.4) 



We fix the discriminant A and set K = (Q(-\/^). The First Main Theorem 
of Complex Multiplication ( ICox89l . Theorem 11.1) states that all values jirq) 
are conjugated over K and hence have the same minimal polynomial over K : 
The ring class polynomial R. Thus we can write R = Y\QeC{A)i^ j(4'q))- It 
turns out that R is an irreducible polynomial in 'Z[X]. The values j(tq) are 
called singular moduli. Hence efficient computation of the singular moduli yields 
a fast algorithm to compute R. In Sect. o we will estimate the range of the 
coefficients of i?; this leads to a formula for the floating point precision F to 
compute R in practice. 



3 Efficient Computation of Fourier Series 

In this section we discuss algorithms to compute the Fourier series of the func- 
tions j, 72 , r], and ( 2 , respectively. These series will be used in Sect. 0] 



3.1 The Fourier Coefficients of the ji-Function 

We determine the Fourier series of the modular function j. As mentioned above 
the j-function is defined on the upper half plane f). As j is holomorphic on f) 
and invariant under the action of SL(2,21) it is periodic with period 1. Thus 
there is a unique Fourier series with j(r) = X)^-oo Cn • Furthermore, as j 
has a single pole at infinity we have c„ = 0 for n < —2 (IDEHDI, Theorem 11.8, 
p.225). 

To compute the coefficients c„ we make use of recursive formulae due to 
K. Mahler. Setting 0 for an empty sum we have for all n G IN ( [IMa,h7fij . p.91, 
equations 46): 



n— 1 



C4n — ^2n+l H" 



C4n+1 — ^2n+3 ^2^271 H” 



H” ^ ^ CfcC2n— fc ; 
k^l 

^n+1 Cn+1 H“ C2rj 



2 2 
n 2n— 1 n— 1 

+ ^ CkC2n-k+2 - ( 1) ^k^4:n—k E ^k^An—Ak j 



k^l 



k^l 



k^l 



C4n+2 — C2n+2 ^ ^ CfcC2n — fc+l ; 






C4n+3 — C2n+4 ~ C2C2n+l 



^2n+l C2n+1 



n+1 2n n 

^ ^ C/i;C2n — fc+3 ^(-1)'" ^CfeC4„ -k+2 + ^ CkC4n-4k+2 

k—1 k—1 k—1 



(3.5) 

(3.6) 



(3.7) 

(3.8) 
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However, to make use of the formulae (E3) - (EBI), we have to know the 
coefficients c_i, cq, ci, C2, C3, and C5 : Obviously, for n = 1 the equations depend 
on Cl, . . . C3. However, (EU yields C5 = C5 in this case, and we have to know C5 
before using Mahler’s equations. 

Thus we first determine these coefficients by evaluating the representation of 
j by the normalized Eisenstein series E 4 and Eq of weight 4 and 6, respectively: 
j{T) = 1728 • definition of the Eisenstein series we refer to 

| IKohaS| . The Fourier coefficients of the Eisenstein series can easily be computed. 
We refer to for details. Finally making use of Mahler’s equations 

- (|2S) we compute the values of c„ up to n = 50000 and store them in a 
file of size 38.7 MByte. The running time on the Pentium HI was 9.87 hours. 
We have c_i = 1, cq = 744, ci = 196884, ca = 21493760, C3 = 864299970, 
C4 = 20245856256, and C5 = 333202640600. 

We are now able to estimate the range of the coefficients of the ring class 
polynomial R for a discriminant Z\. As in Sect. |3 set q = with Q = (a,b,c). 

We first remark that for (a, b, c) and (a, —b, c) the respective values of q are 
conjugate complex numbers. Hence if (a, 6, c) and (a, —b, c) are both in C{A) the 
Fourier series of j shows that we have j{T[a,-b,c)) = j{^a,b,c))- Thus computing 
j{T(a,b,c)) yields j{T(^a-b,c)) for free in this case. We can therefore restrict to 

reduced representatives (a, b, c) with 6 > 0. Furthermore, we have |g| = e s . 
Hence, for fixed A, |q| only depends on a, and using the Fourier series of j we 

, , ir-v/lW „ _ 

get |j(tq)| Ri |-| = e 5 . Thus the constant term of R is up to sign of order 

of magnitude “. In most cases the constant term of R is up 

to sign the biggest coefficient. Hence the fioating point precision 



F = 5 + 



4 



log 10 



E 

{a,b,c)^C{A) 



1 

a 



(3.9) 



as proposed in [I ,/t)4) is in practice sufficient to compute R. The correction term 
5 + takes into account both the approximation |j(rQ)| ~ |^| and the fact 
that the constant term of R may not be up to sign the biggest coefficient. We 
computed dozens of polynomials R, and we are not aware of any counterexample 
where F does not yield a right result. 

We remark that there is another efficient method to compute the Fourier 
coefficients of j due to M. Kaneko f [IKa.nj V who extends work of D. Zagier. We 
refer to his paper for details. 



3.2 Computing the Fourier Coefficients of 72 

For r G t) one can easily derive the formula 72 (t) = q~^ ■ ■ As in the 

case of the modular function j, we present recursive formulae due to Mahler to 
compute the Fourier coefficients 

However, Mahler takes a slightly different representation: He multiplies the 
term q~3 into the sum and writes 72 (t) = + Yl^=obzn+ 2 q^~ ■ Hence we 
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have go = 1 and g„ = b^n-i for n > 1. If we set 0 for an empty sum we have for 
all n S INo l |Mah7ti| . p.ll5, equations 79): 

n— 1 

^ 12 n +2 = ^ 6 n +2 + ^3fc+2^6n-3fc-l 5 (3.10) 

h h I. L I ^3n+2 ^3n+2 , ^6n+2 ^6n+2 /'o 1 1 \ 

Ol2n+5 — 06n+5 ~ ^206n+2 + ^ (3.11) 

n—1 2n—l 

+ ^ ^ ^3fe+2^6n-3fc+2 ~ ^^3fc+2^12n-3fc+2 

k^O k^O 

n—1 

+ ^3fc+2^12n-12A;-4 , 



I k U I ^3^+2 ~ ^3n+2 

Ol2n+8 — ^6n+5 + 03fc+206n-3fc+2 H ^ , 

T 2 t ^ 

7 7 77 ^ 6 n +5 ^ 6 n +5 \ '' 7 7 

Ol2n+ll — 06n+8 ~ ^2^Qn-\-b ~ 03fe+206n-3fe+5 






2n 



— 1 )^ ^&3fe+2^12n -3fc+8 + ^3fc+2^12n- 



12/C+2 









(3.12) 

(3.13) 



Evaluating these equations for n = 0 we have t »2 = & 2 , = ^ 5 , = ^5 + 

( 6 | - 62)72 and 6 n = 63 - 6265 - (65 - 6s)/2 + 6265 + 6263 + 6 |. Hence 62 
and 65 uniquely determine all coefficients 6 „. The defining equation of the g„ 
yields j(r) = 7 |(r) = q~^ ■ (1 + 3gig + 3(g? + 52 ) 9 ^ + 0{q^)). Thus comparing 
with the Fourier series of the j-function we get gi = cq/3 and 92 = ci/3 — 9 J, 
hence 91 = 248 and 92 = 4124. Using g„ = 63 n_i for n > 1, we computed the 
coefficients g„ up to n = 50000. The memory to store these coefficients is 22.3 
MByte. The computation took us 5.23 hours on the Pentium III. 



3.3 Computing the Fourier Coefficients of the ry-Punction 

We next determine formulae to compute the Fourier coefficients of the 9 - function. 
We first make use of a result due to Euler, which states 

^(t)=9^ (3-14) 



Multiplying 924 into the sum and using some properties of the Jacobi symbol 

{-), we get 9 (r) = (^) 9 ^- 

Equation ([i.1411 shows that we can define a sequence (ere)„g]No by r;(r) = 
924 e„ 9 ". We prove the following result: 
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Proposition 1. Let e„ be as defined above. Then we have for all n G INq 






0 , 

( "l 

V V24n+1 J ’ 



if 24n + 1 is not a square in 'Zi, 
if 24n + 1 is o square in Z. 



Proof: The representation of the 77 -function by the Jacobi symbol and the def- 
inition of the e„ yield (^) ■ Comparing the coeffi- 
cients, we get e„ = ^ ^ 24 ^+ 1 ) i if = 24 ti J- 1, and e„ = 0 otherwise. □ 

Proposition n] shows that we have G {—1, 0, 1}. Furthermore, as the expo- 
nents in (EH grow quadratically coefficients 0 are rather sparse. Making 
use of proposition Ewe computed e„ up to ?r = 5000000, which took us 16.3 sec- 
onds on the Pentium III. The memory to store these coefficients is 9.54 MByte. 



3.4 Computing the Fourier Coefficients of the Weber Function f 2 

The main task in computing the coefficients of the Fourier series of f 2 is to 
determine the representation of the infinite product in (E3 by a power series in 
q. Obviously this product can uniquely be written as a Fourier series fnQ"- 

Making use of the equations ( F. 1 I ) and one easily sees f 2 (r) = \/2 ■ In 

Sect. 13.31 we derived the formula ? 7 (t) = q^ ry-function does 

not vanish on () we use the Fourier series of f 2 and 77 to get 

00 

n— 0 

Hence, making use of eo = /o = 1, equation yields for tt, € IN 

IkSn-k, if 2tn, 

ef - Xfe=d fkBn-k, if 2 I n. 

This yields an algorithm for computing the coefficients /„. According to our 
experience this is the most efficient way in practice to compute the Fourier 
coefficients of f 2 . As in the case of the j- and the 72 -function we determined all 
coefficients /„ up to tt = 50000, which took us 22 minutes on the Pentium III. 
The necessary amount of storage is 5.48 MByte. 




n—O 



00 



, e„g = 



n—0 



n—0 \k—0 



fk^n—k I ■ Q 



(3.15) 



4 Efficient Computation of Singular Moduli 

This section deals with comparing alternative methods to compute singular mod- 
uli. First let an imaginary quadratic discriminant A and a reduced representative 
Q G C(A) be given. Furthermore, fix a floating point precision F. We show that 
in practice the computation of J(tq) within precision F using an efficient repre- 
sentation of the 77 -function is by far the most efficient alternative. Furthermore, 
we present running times for all alternatives we make use of. We conclude that 
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the computation of a ring class polynomial of degree 200, which may be explored 
for cryptographic purposes, takes about 5 minutes on the SUN. 

The formulae and relations of Sect.Elmake the following proceeding plausible: 
As the functions j, 72, and f2 can be expressed in terms of Dedekind’s ry-function, 
we first compare three different approaches to determine the value Next 

we explore alternatives of computing f2(TQ) by its product formula ii'Z.'zii . its 
Fourier series from Sect. El and finally by the term f2(rQ) = in the 

last expression we make use of our results on the ry-function. Similarly we deal 
with 72 and j. 



4.1 Efficient Computation of the ry-Punction 

Section ro yields three different methods for computing ?y(r) : Making use 
of the product formula ■ i(i - 9"), evaluating the Fourier series 
gsi using the Euler sum (ft. 1411 . These tasks are performed by 

our algorithms computeEtaViaProduct, computeEtaViaFourierSeries, and 
computeEtaViaEulerSum, respectively. In this section we show that using 
computeEtaViaEulerSum turns out to be in practice the most efficient way for 
computing 7y(r) for a given floating point precision F. 

Input of all three algorithms is a complex number r in the upper half 
plane () and a floating point precision F that we use in our computations. 
Each algorithm returns the value ry(r) within the precision F. Both the first 
and second algorithm are straightforward. Hence we only explain our algorithm 
computeEtaViaEulerSmn(T, E). We have 



9 "“ XI 




00 




3 71 ^ 71 

We split the sum in two partial sums and set iV_(n_) = — = 2 — ^ respectively 

A^+(n_|_) = — ± 2 — Hence N_ respectively IV+ are the exponents of each partial 
sum. The difference of two exponents is A^_(n_ + 1) — N_(n_) = 3n_ + 1 
and N+{n+ + 1) — N+{n+) = 3n+ + 2, respectively. Depending on whether the 
current value of N- respectively N+ is the minimum of {A^_,iV_|_} we compute 
the appropriate power of q and adapt the value of N- respectively . It is easy 
to see that we always have N_ ^ fV+. 

Running times on the SUN of our three algorithms for all reduced repre- 
sentatives of discriminant Z\ = —21311 are plotted in the appendix. A is the 
maximal discriminant of class number 200. Further timings for discriminants 
of class numbers up to 500 can be found in [fia.iOl j . The results indicate that 
computeEtaViaEulerSmn(TQ, E) is the most efficient of our algorithms to com- 
pute r]{TQ) for a high precision E, i.e. E > 2000. We remark that the runtime 
benefit of computeEtaViaEulerSum becomes more definitely for growing o, hence 
with growing q, which is the important case for computing ring class polynomials. 
However, we tested our algorithms for low precisions and got similar results. 
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computeEtaViaEulerSuin(r, F) 

Input: A complex number r € f). 

A natural number F serving as floating point precision. 
Output: The value 7?(r) within the floating point precision F. 



Ti— ^ n.-|_ ^ 

N- ■4— 5; N+ 4— 7; // to store the exponents of q in the partial sums 
/ 4— 2; //to store the previous used exponent of q 
q 4— q^ q^\ j j q-n stores the values q" 

r; 4— 1 — q — // q stores the current approximation of q{T)/q^ 

while true do 

s 4— 1; / / s stores the sign 
if N- < N+ then 

N —I 

Qn ^ Qn ' Q 1 

if 2 I n_ then 

s 4 1; 

end if 

I ^ N--, 

N- 4— N- + 3n_ + 1; n_ 4— n_ + 1; 
else 

4- • q + ; 

if 2 I n+ then 

s 4 1; 

end if 

l^N+-, 

4 — A^+ “t“ 3n_|_ -f 2; Tl-^- 4 — U-}- -t- Ij 
end if 
4- ?7 + s • qn; 
if |qn| < 10“^ then 
return( q53 • q ); 
end if 
end while 



4.2 Efficient Computation of the Weber Function f 2 

We now turn to the Weber function f 2 . Our three approaches to compute U{tq) 
are already mentioned in the beginning of Sect. 0and are similar to the case 
of the previous section. The task of determining the value ^ 2 {tq) is performed 
by our three algorithms computeF2ViaProduct, computeF2ViaFourierSeries, 
and computeF2ViaEta. We will see that using computeF2ViaEta is in practice 
the most efficient approach. 

Input of all three algorithms is the same as in the case of the ry-function. 
Each algorithm returns the value f 2 (T) within the precision F. The algo- 
rithms computeF2ViaProduct and computeF2ViaFourierSeries are straight- 
forward. However, computeF2ViaEta makes use of the equation f 2 ('rg) = 
main observation is that we can compute numerator and 
denominator of this expression simultaneously; this is an obvious consequence 
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of ci{2t) = . Hence we get our algorithm computeF2ViaEta(r, i^) from 

computeEtaViaEulerSmn(T, F) if we change the following lines {rj 2 stores the 
current approximation of ry(2r)/gi2): 



computeEtaViaEulerSum 


computeF2ViaEta 


rj^l-q-qn, 


rj^l-q-qn, 772 ^ 1 - 


rj ^ rj + s- qn, 


r] ^ r] + s- qn, m ^ m + s ■ qi 


return( 524 ■ rj ); 


return( \/2q^ ’ ^ 



Obviously computeF2ViaEta returns the value f2(T) within the precision F. 
We compare the running times of all three algorithms for the reduced repre- 
sentatives of C(— 21311) in the appendix. Further practical results are listed in 
As in the previous section computeF2ViaEta seems to be the most ef- 
ficient of our algorithms to compute f2(T"Q) for high precision F, i.e. F > 2000. 
However, according to our experiments the same holds for low precisions. 



4.3 Efficient Computation of the Functions 72 and j 

Next we turn to the computation of the functions 72 and j. The result of the 
previous two sections is that we have to compare the running time of the compu- 
tation of 72 by its Fourier series of Sect. l3.2l to the computation via the defining 
equation (O using the algorithm computeF2ViaEta. This yields our algorithms 
computeGamma2ViaFourierSeries(r, E) and computeGaimna2ViaEta(r, E), re- 
spectively. The input of the algorithms is the same as in the previous sections; 
both algorithms return the value 72 (t) within the precision F. Furthermore, their 
implementation is straightforward. Practical results for C'(— 21311) are plotted 
in the appendix. It turns out that computeGainma2ViaEta is more efficient in 
practice. 

Finally we regard the computation of singular moduli. As in the case of 
the function 72 we take two approaches into account: First we use algorithm 
computeJViaFourierSeries(r, E), which implements the computation of j(r) 
by its Fourier series of Sect. rm Second we make use of the definition (E3 
and algorithm computeGcmima2ViaEta(T, E); this approach yields our algorithm 
computeJViaEta(r, E). The input of both algorithms is as above. The run- 
ning times for the class group of discriminant A = —21311 can be found 
in the appendix. Further practical results are listed in Table ^ Obviously 
computeJViaEta is more efficient in practice. We remark that computeJViaEta 
does not need any precomputation and storage of coefficients. Furthermore, it 
only requires a small amount of main memory. 

We conclude that the computation of the ring class polynomial of A = 
—21311 using our efficient algorithm computeJViaEta takes about 5 minutes 
on the SUN; hence the generation of elliptic curves suitable for use in cryptog- 
raphy and having an endomorphism ring of class number 200 is feasible even 
when using ring class polynomials. This is in contrast to the opinion in the cryp- 
tographic community that using singular moduli does not yield an algorithm 
for generating elliptic curves having complex multiplication by an order of class 
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Table 1. Total running times on the SUN of the two alternatives to compute all 
hp{A) values j{T(^a,b,c)) for (a,b,c) £ C{A), b > 0. tp denotes the running time of 
computeJViaEta, tf the running time of computeJViaFourierSeries. 



A 


h{A) 


hp{A) 


F 


tp in s 


tf in s 


tpftf 


-21311 


200 


101 


2234 


290.10 


486.86 


0.59585918 


-30551 


250 


126 


2837 


594.02 


1068.57 


0.55590181 


-34271 


300 


151 


3288 


935.46 


1765.62 


0.52981955 


-47759 


350 


176 


3982 


1808.78 


3663.97 


0.49366671 


-67031 


400 


201 


4809 


2907.84 


6235.32 


0.46634976 


-75599 


450 


226 


5331 


3792.89 


8371.53 


0.45307011 


-96599 


500 


251 


6104 


5623.53 


12827.73 


0.43838855 



number 200 and having reasonable practical running times. This becomes more 
evident when realizing that our SUN platform is far from being high-end. 
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A Running Times 

We present running times on the SUN for all our algorithms of Sect. El to com- 
pute the values f2(TQ), 72 (tq), and j(tq) for all reduced representatives 

Q = (a, 6, c) of C(— 21311) with 6 > 0. We choose A = —21311 as Z\ is the max- 
imal imaginary quadratic field discriminant of class number 200. Namely, for 
cryptographic purposes the German Information Security Agency [IRSlOOl rec- 
ommends to use imaginary quadratic field discriminants of class number > 200. 

As explained in Sect. 13. li the running time to compute the value for a reduced 
representative (a, 6, c) mainly depends on a. Hence we plot the running time as 
a function of a. We use the floating point precision F — 2234, which comes from 
formula (1^ . 
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Abstract. An algorithm given by Ambainis and Freivalds |P constructs 
a quantum finite automaton (QFA) with O(logp) states recognizing the 
language Lp = {a*| i is divisible by p} with probability 1 — e , for any 
£ > 0 and arbitrary prime p. In [3j we gave examples showing that the 
algorithm is applicable also to quantum automata of very limited size. 
However, the Ambainis-Freivalds algoritm is tailored to constructing a 
measure-many QFA (defined by Kondacs and Watrous |2]), which cannot 
be implemented on existing quantum computers. In this paper we modify 
the algorithm to construct a measure-once QFA of Moore and Crutchfield 
^ and give examples of parameters for this automaton. We show for the 
language Lp that a measure-once QFA can be twice as space efficient as 
measure-many QFA’s. 



1 Introduction 

For a background on quantum finite automata (QFA) and quantum computa- 
tion in general, see Gruska’s monograph 0. Roughly, two models of quantum 
automata are current, the measure-once model due to Moore and Crutchfield 
0 and the measure-many model due to Kondacs and Watrous 0. Essentially, 
the former differs from the latter in that the quantum measurement is done 
once only at the end of the processing of an input string. Though the limitation 
to a single measurement limits the processing power of the automaton, it also 
saves computing space - an essential practical advantage, considering that the 
quantum computers of today have no larger than a few bits register. In this 
spirit, the present paper investigates the space efficiency of QFA’s recognizing 
the languages Lp = {a*| f is divisible by p} where p is a prime integer. Our point 
of departure is an algorithm given by Ambainis and Freivalds [Q to construct 
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a measure-many quantum finite automata with 0(logp) states recognizing the 
language Lp with probability 1 — e, for any e > 0. In ^ we showed by exam- 
ples that the algorithm is applicable to quantum automata of very limited size, 
though in general not small enough to be implemented on existing quantum com- 
puters. In this paper we modify the algorithm to construct a measure-once QFA 
of Moore and Crutchfield recognizing Lp and give examples of parameters for 
this automaton. We show in particular that the language Lp can be recognized 
twice as space-efhciently by a measure-once QFA than by measure-many QFA’s. 



2 Measure-Once One-Way QFA 

A measure-once 1-way QFA 0 is a tuple U = {Q, S,S,qo,F) where Q is a set 
of states of finite cardinality n with a singled out starting state qo G Q and a 
subset F d Q oi accepting states, S is finite input alphabet with left end-marker 
symbol ^ and right end-marker symbol $ and <5 = S{q, a, q') is a unitary transition 
function 

5\ QxSxQ^C 

representing the probability density amplitude that flows from state q to state 
q' upon reading a symbol ct; the unitarity of 5 is understood as the unitarity 
of the n X n matrix fy, defined by S{q,a,q')q^gi^Q for every a G S. An input 
string X = X1X2 ■ ■ ■ Xm G S™ is read left to right; we assume all input ends 
with the end-marker $, the last symbol read before the computation terminates. 
At the end of a computation, the automaton measures its configuration and 
outputs a probability of acceptance of the input string. The computation and 
measurement are done as follows. The matrices T„ act on n-dimentional com- 
plex unit vectors \'F) G C*®, the configurations of Lf . In the orthonormal basis 
{k)}qgQ corresponding to the states q G Q oi U,one writes \F) — J2qeQ ^<1 ' I?)’ 
J2qeQ Wqf' = 1) saying that \<F) is a linear superposition of (pure) states q G Q. 
The coefficient aq is interpreted as the probability density amplitude of U being 
in state q. Then, if U is in configuration \W) and reads symbol a, it moves into 
a new configuration 



'^'^1^)= X! “9 • ■ k') ) 

q,q'€Q 

and hence, after starting in the state qo and reading a string x = X1X2 ■ ■ ■ Xm, its 
configuration becomes \Fx) = ko) ■ The measurement is then 

performed by a diagonal zero-one projection matrix P with Pqq = 1 if and only 
ii q G F\ the probability of accepting the string x is defined as 



Pm{x) = {F,\P\F,) = \\P\F,)\\\ 
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3 The Ambainis-Freivalds Algorithm for Measure-Once 
Automata 

Let p be a prime integer and consider the language consisting of all words in 
a single letter a of length divisible by p. 

Theorem 1.1 For any e > 0, there is a QFA with O(logp) states recognizing 
Lp with probability 1-e. 

Theorem 1.2 Any deterministic 1-way finite automaton recognizing Lp has at 
least p states. Generally, 1-way probabilistic finite automata can recognize some 
languages with the number of states being close to the logarithm of the number 
of states needed by a deterministic automaton; see |5j and jHI- However, this is 
not the case with Lp. 

Theorem 1.3 Any 1 -way probabilistic finite automaton recognizing Lp with prob- 
ability 5 + e, for a fixed £ > 0, has at least p states. 

Corollary 1.1 There is a language Lp such that the number of states needed 
by a probabilistic automaton is exponential in the number of states needed by a 
1-way QFA. Complete proofs of the theorems are given in In this paper we 
only relate the algorithm by which the automaton in Theorem 1.1 can be con- 
structed and which was used to find examples given bellow. First, we construct 
an automaton accepting all words in Lp with probability 1 and accepting all 
words not in Lp with probability at most 7/8. 

Let C/fc, for k G {1, ..,p — 1} be a quantum automaton with a set of states 
Q = {QOiQacc}, a starting state |go)? F = {<lacc}- The transition function is 
defined as follows. Reading a maps |go) to cos(/?|(7o) + i An ip\qacc) and \qacc) to 
*sini^|(7o) + cosf/j|gacc) where p = The superposition of Uk after reading 
is cos(^^)|(;o) + sin(^^^)|(7occ)- Therefore, the probability of Uk accepting a® 
is cos^(^^^). If j is divisible by p, then is a multiple of 27 t, cos^(^^^) = 1 
and, therefore, all automata Uk accept words in Lp with probability 1. For any 
word not in Lp, at least of all Uk reject with probability at least 
We call such Uk “good” . 

Next, we consider sequences of |"81np] k’s. A sequence is good for if at 
least I of all its elements are good for aF There is a sequence, which is good for 
all j G {1, ..,p — 1}. This sequence is good for not in Lp with j > p as well 
because any Uk returns to the starting state after and, hence, works in the 
same way on and 

Now, we use a good sequence fci, ..., fc|-ginp] to construct a quantum automa- 
ton recognizing Lp. The automaton consists of Uki, Uk 2 , ■■■ ,Uk\8inp] and a 
distinguished starting state. Upon reading the left endmarker /, it passes from 
the starting state to a superposition where \qo) of all Uki have equal amplitudes. 
Words in Lp are always accepted because all Uk accept them. Words not in Lp 
are rejected with probability at least |. Finally, we sketch how to increase the 
probability of correct answer to 1 — £ for an arbitrary £ > 0. We do it by in- 
creasing the probability of correct answer for each Uk. Namely, we consider an 
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automaton with 2“^ states where d is a constant depending on the required 
probability 1 — e. The states are labeled by strings of O's and I's of length d : 
do . ..00 , <Zo....oi and so on. The starting state is the state qo...oo corresponding to 
the all - 0 string. The transition function is defined by 

d 

5{dxi...XdT ^idyi...Vd) ~ ) n, ). 

f = l 

The only accepting state is go...oo- The amplitude of \qo) (8> 0 |qo) = <Zo...oo in 

this superposition is cos‘^(^^^). If j is a multiple of p, then this is 1, meaning 
that words in Lp are always accepted. For not in Lp, we call S - good if it 
rejects with probability at least 1 — <5. For a suitable constant d, at least 1 — d 
of all [/^ are good. Taking d = | and choosing d so that it satisfies the previous 
condition completes the proof. 

The idea of the proof may be visualised geometrically as follows. We mark 
amplitudes of |(jo) on one axis and amplitudes of |qi) on another. The automaton 
starts in superposition |go) (see Fig. ^ n°). As the letter a is read, the vector is 




Fig. 1. 



rotated by angle (p, where <p = After rotation the end of vector is located 
at point . Reading next letter a new rotation is performed and a? is reached 
and so on. It is easy to see that the vector returns to starting state after reading 
p a’s, hence, the automaton works in the same way on and and we 
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can consider j € {0,..,p— 1}. The amplitudes of jgo) and |gi) after reading 
equals to vector projections onto the axes go and gi respectively (Fig. P, dotted 
line). Every input word is accepted with probability cos^ {j(p). It equals to 1, if 
j is a multiple of p, and is less than 1 otherwise. So in the abstract Lp can be 
recognized even with one Uk, however, in fact ip approaches 0 as p grows and for 
more and more j's the number cos^(jp) is close to 1. 

The main idea of the algorithm is thus to take several ki,k 2 , fcn counter- 
vailing bad cases of one another. In the instance shown in Fig.Q /ci = 1, = 2 

and p= 5. We can see that the input word will be accepted by U 2 with large 
probability (a^ in brackets) and will be accepted by Ui with small probabil- 
ity (a^ without brackets). And vice versa with - U\ will accept it with large 
probability, but U 2 with small. Using the two automata at a time, they mutually 
correct “bad” probabilities and performs better than each of them individually. 




Fig. 2. 



4 Examples of Parameters 

Assuming the largest quantum computer built has seven bits of memory, none 
of our examples may have more than 2^ = 128 states. To implement each Uk 
two states are needed - one accepting and one simple working state. Additionally 
one starting state is necessary common for all Uk’s. So, if we arbitrary choose a 
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sequence of \Slnp\ Uk as described before, then \8lnp] <128/2 = 64 and it follows 
that p can not be greater than 2971. 

However, we can choose shorter sequences of Uk and still obtain good results. 
We denote hy tp = ^ the smallest angle by which the rotation can be performed 
after reading an input symbol /a/ and take it for unit angle. 



1 


2 




P-1 


2 


4 




2*(p - 1) mod p 










P-1 


2*(p - 1) mod p 




(p - 1)2 mod p 



Fig. 3. 



Let us examine tables of products modulo p, i.e. tables containing numbers 
(i * j) modp, where t is a number of row and j is a number of column. The 
row of the table corresponds to automaton Uk and the column of the 
row shows, by how many units the vector is turned away from the starting state 
after j input symbols are read. I.e. the 2 “""^ row describes automaton which 
superposition vector rotates by angle 2ip = 2^ (by 2 units) after reading every 
input symbol /a/, and the column in this row shows that after reading 3 
symbols faf the vector is rotated by angle 3*2^ = 6^ (6 units). For any language 
Lp (for any p) we aim to find a combination of automata Uki, Uk2, Ukn such 
that: 

1. a number of automata in combination is as small as possible; 

2 . a probability of giving correct answer is as big as possible. 

So we search for a sequence of rows ri, r 2 , .., r„ in table such that the greatest 
sum cos^(^/ * Tij) - the probability of accepting a word not in Lp - is as 

small as possible {rij is the element of the row, j G {1, . . . ,p — 1}). 

To ease the search the table can be reduced as follows. 

1. As (p — i) ■ (p — j) = p^ — ip — jp + ij = zj(niod p) the table is symmetrical 

for center, and this means that rows i and {p — i) are mirror displaying each 

other. 

Additionally, i{p — j) = ip — ij = — zj(mod p) so aj = p — Qp-j. 



ai 


a2 




3p-2 


Sp-l 


3p-i 


3p-2 




»2 


ai 



Fig. 4. 
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2. Since operation of multiplication is commutative, the table is also symmetri- 
cal with respect to main diagonal and 1®* assertion is also right for columns. 

3. It is possible to notice in figures given above that Uk accepts input words Oj 
and ttp-j with equal probabilities. Indeed, cos^(^^'^y^) = cos^(— = 
cos2(2^^). Thus we can replace all the numbers greater than by numbers 
P- j- 

4. It follows form statements 1 and 2 that the and the (p — rows will 
be identical after replacement, and the same can be said about the columns. 
For example, for p = 11 the table looks as shown in Fig. 0 
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Fig. 5. 



So it is enough to consider the part of the table with row and column numbers 
ranging from 1 to We tint cells in the matrix, containing numbers Vij that 
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Fig. 6. 



we consider “bad”, i.e. having | cos(rij'(/;) | greater than some fixed constant <5, 
and search for combinations of rows, containing no columns with number of 
tinted cells greater than ^ of number of rows. This guarantees that a sufficient 
number of U^s gives the right answer with good probability after any number 
of letters read. 

It is very important to choose right constant 5. If it is too big, amount of 
check variants increases unnecessary, if the constant is too small - it becomes 
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impossible to find any combination meeting the initial demands. In addition, 
small (5 does not guarantee that the combination found will be successful, and it 
is possible that taking greater S allows finding a better combination. 

Take, for example, p = 23 and S — 0.9. Taking k\ = 1, /c2 = 2, ^3 = 6 and 
k 4 = 10, the words not in Lp are accepted with probability at most 0.62579395. 
If 5 = 0.8, this combination turn impossible, because I'** column contains 3 
’’bad” cells. We construct a new automaton from 4 f/fc’s, by adding a common 
starting state to them. And since words not in Lp are accepted with probability 
greater than 1/2, even if p is as small as 23, we add another new state to our 
automaton, so as word is rejected with some probability without reading any 
letters. Thus the automaton accepts words not in Lp with probability less than 
1/2, but words in Lp - with probability greater than So altogether we need 
4*2+l + l= 10 states. The greatest p for which Lp can be recognized with 
deterministic or probabilistic automaton having at most 10 states is 7. However 
a QFA can be constructed for much greater p’s. 





For every prime p we search for a combination of 4 automata Uki, Uk 2 , 

Uk 4 , represented by their numbers fci, k 2 , k^ and k^ such that the greatest 
sum X)i=i cos^('0 *ki* j) {j G {1, ..., is as small as possible. The values 

were calculated by computer. Results are shown in Fig. 0 In the first column 
of table the values of p are given, the next 4 columns contain values chosen for 
fci, ^2, ^3 and ki, and the last - probability of getting a correct answer from the 
new automaton. So every row of the table means: ”If prime p is given, then 
to recognize language Lp 4 automata should be taken. The first automaton 
rotates by k\ units after reading every input symbol, the second - by ^2 units, etc. 
The automaton constructed in such way recognizes language Lp with probability 
given in last column.” 

Example, if p = 23, then the automaton composed from Ui, U 2 , Ue and C/10, 
will accept words in L23 with probability 0.687103025 and will reject words not 
in L 23 with probability at least 0.687103025. We can notice that as p grows, as 
probability of correct answer decreases. Enlarging a number of C//s can increase 
it. The table shown in Fig. E|is analogical to that in Fig.|Bl though now six Uk’ 
are combined. We can see that the probability of the correct answer is greater 
that in case of 4 k’s, but the amount of states needed is increased and we need 
6*2+2 = 14 states instead of 10. To show better the relationship between number 
of states and the probability, we use following characteristic. If n rows are chosen 
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from the table and we select an arbitrary column in these rows, then the rows 
corresponding to numbers in column, contain all the same combinations as initial 
rows. 
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Example, p = 23, we choose rows 1,2,6 and 10. Then we select any col- 
umn, let’s say 7*^. The column contains numbers 1,4,7 and 9. Now, if we take 
1®*,4*^,7*^ and row, that the columns contain the same combinations as 
rows 1,2,6 and 10 iFig. ITTIl . 

Thus we can get different automata, which have the same probability 
of correct answer but have different “worst” words. It is possible to construct 
a new automaton as combination of two such automata. The new automaton is 
two times bigger, but has better probability. 

Example, p = 1223. We use for combination the example given above, and 
after that make the combination from result of the first. Thus we make an au- 
tomaton with 18 states from two automata having 10 states each, and then an 
automaton having 34 states from two automata with 18 states. Automata con- 
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Fig. 10. 



structed in such way are far from being optimal, but this method is easy to use 
and it gives certain improvement of probability. Overall these examples show, 
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Fig. 11. 



that, by increasing number of states, it is possible to construct a quantum au- 
tomaton giving the correct answer with large probability. On the other hand, 
if we acquiesce in a smaller probability, we can reduce the size of automaton 
notably. And finally, all the examples show that quantum automata can be very 
space-efficient. The automaton in last example has 34 states, and it recognizes 
Li 223 with probability 0.65173586. At the same time, any deterministic or prob- 
abilistic automaton recognizing this language has at least 1223 states. 

We can see also that the measure — once quantum finite automata is about 
2 times more space efficient than corresponding measure — many automaton 
because of its simple construction. Indeed, every Uk for measure — once has 2 
states while for measure — many automata 4 states are needed. 
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Abstract. A mathematical model of a knowledge representation 
system (KR-system) is proposed. Its prototype is the concept of an 
information system in the sense of Z. Pawlak; however, the model is, 
in fact, a substantial extension of the latter. In our model, attributes 
may form an arbitrary category, where morphisms represent built-in 
functional dependencies, and uncertainty of knowledge is treated in 
terms of category theory via monads. Several notions of simnlation 
are also considered for such KR-systems. In this general setting, the 
semiphilosophical problem mentioned in the title, still open, is given a 
precise meaning. 

Keywords: complete information, fuzzification, knowledge representa- 
tion system, simulation, uncertainty 



1 Introduction 

1 . Z. Pawlak introduced his notion of information system in late 70’ies M- 
Various versions of this notion has been given other names; probably, the best 
known terms are ‘attribute system’ and ‘knowledge representation system’ (see 
mil). We choose here the traditional term and adopt the later one for our 
general systems in the subsequent section. 

A deterministic information system is a system of the form {Ob, At,Val, f), 
where 

— Ob is a non-empty set of objects, 

— At is a set of their attributes, 

— Val is a family {Vala,a G At) of sets: each Vala is considered as the set of 
values of a, 

— / is a function Ob x At ^ [j{Vala- a G At) such that f{o,a) G Vala, it is 
called the information function. 

Supposing that the information function here might be of type 
Ob X At ^ \^{Po{Vala). a G At) 

and satisfy the condition f{o,a) C Po{Vala), where Po{X) stands for the set 
of nonempty subsets of X, we come to a wider class of information systems 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 34- ITTO 2001. 
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which contains also nondeterministic ones. Formally, any deterministic informa- 
tion system can be regarded as a complete system of this kind with each /(o, a) a 
singletone. With this in mind, we shall further discuss only these general systems. 



2. Now a question can be posed as to whether there exist essentially indetermin- 
istic information systems. The easy answer is negative; however, it makes sense 
to state the problem more formally and to examine some details of its solution. 

Let S := {Ob, At,Val, f) and S' := {Ob' , At' ,Val' , f) be two information 
systems. Intuitively, S' contains more information than S if the objects of S 
can be modelled by those of S' in the following sense: there are 

— a mapping m: Ob — >■ PQ{Ob') representing each object from Ob as a (fusion 
of a) collection of objects of S'. 

— a function I: At ^ At' which codes each attribute of S by an attribute of 

S', 

— a family A of functions Xa- which interprets in S values of 

these codes in S', 

such that, for all o G Ob and a G At 

f{o,a) = {Aa(u^): v' G f'{o',I{a)) for some o' G m{o)} . (1-1) 

In particular, if At C At', Val'^^ C Vala, and I and Xa are all the identity 
mappings, then (EJ amounts to 

f{o,a) = |J(/'(o',a): o' G m{o)) . 

We could consider a more general concept of modelling, where I interprets 
At into Po{At') and each Xa is a function of type Y\{V al'^,\ b G I{a)) — >■ Vala- 
Alternatively, the subsets of attributes we are interested in could be included in 
At as additional elements (“complex attributes”); in doing so, some structure 
on At should be introduced. This is the way we proceed in the next section, but 
these refinements are not relevant to the present discussion. 

Our problem can now be stated as follows: 

are there information systems that cannot be modelled by a deterministic 
one? 

As we noted above, the answer is negative. Indeed, let S = {Ob, At, Val, /) be an 
information system. Put Ob' to be the set of all maps ip'. At — >■ [J{Vala'. a G At) 
such that ip{a) G Va for all a G At, and define the function /' on Ob' x At 
by f{(fi,a) := (fi{a). Then S' := {Ob' , At, Val, f) is a deterministic information 
system. Given an object o G Ob, denote by m(o) the subset {ip G Ob': ip{a) G 
f{a,o) for all a G At}. If / and all Xa are the identity selfmaps on At and Vala 
respectively, then the triple {I, X, m) shows that S is modelled by S' . 
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3. Our aim in this paper is to call attention to this “determinability” problem 
in a maximally general setting, in which case it could have a negative answer as 
well. Towards this end, we propose a rather general concept of an information 
system (renamed below a knowledge representation, or KR-system), which is 
described in terms of category theory. Consider At as a discrete category, Val as 
an At-shaped diagram in the category of sets §, and Ob as an object of S. For 
each a S At, the function / induces a family /* of mappings fa - Ob Po{Va) 
by fa{o) := /(o, a). Then { At, Val, Ob, f*) is an extremely simple example of 
an extended knowledge representation system. The basic definition (Def. is 
motivated even by likewise rearranged deterministic information systems. 

Aside from this introduction, the paper consists of three sections. In Sect. 
2, we introduce the concepts of a (deterministic) KR-system over an arbitrary 
category C and a simulation for these systems, and show that KR-systems form 
a category if simulations are taken for morphisms. We also discuss here the 
intuitive meaning of our abstract definitions. KR-systems admitting uncertainty, 
or fuzzy (in a wide sense of the term) systems are considered in Sect. 3. Formally, 
they are deterministic KR-systems over the Kleisli category of C w.r.t. some 
monad, and uncertainty becomes apparent only when they are reinterpreted 
relatively to C itself. At last. Sect. 4 contains a precise statement of the mentioned 
problem and some related preliminary results. 

In this introductory paper we primarily tried to elaborate the very concepts 
of a (model of) knowledge representation system and simulation. The presented 
results, all without proofs, are mainly technical or illustrative. The reader is 
supposed to be familiar with the basic notions of category theory, such as a 
functor, a natural transformation, a (co)limit of a functor, a monad, and is 
referred to for further information on these matters. 

2 Knowledge Representation Systems 

1. Let C be some category. 

Definition 1. By an knowledge representation system, or just KR-system (rel- 
atively to C) we mean a quadruple S (A,V, S, U), where 

— A is a small category (with the object set |A|^, 

— V is a contravariant functor from A to C that respects colimits (i.e. take 
them into limits in V (A) ), 

— S is a C-object, and 

— n is a cone (tt^: S' — ?> K(a)),a S |A|) in C to the base V. 

Let us shortly discuss the intuitive meaning of this formal scheme. 
Intuitively, the objects of A are attributes which the system S “recognises”, 
and V assigns to every attribute a a C-object V), := V{a) interpreted as the 
stock of possible values of a. An arrow from a to 6 in A indicates that a depends 
on b (see below). If & G A is the colimit of some diagram D in A, then we may 
think of the attribute 6 as a complex attribute “consisting” of the nodes of D 
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as interconnected components. It is, then, natural to regard that the value of b 
is completely determined by the values of these components. This is why V was 
required to respect colimits of A. 

As usual, attributes in A are thought of as attributes of certain objects or 
(to avoid misunderstanding of the term ‘object’ in the present context) entities. 
Following the tradition of Pawlak’s information systems, the object S should 
be considered as the object of these entities. We prefer a somewhat different 
interpretation; namely, we call S the object of knowledge states, each state being 
considered as representing a more or less complete description of an entity. Ac- 
cordingly, each value, attached to the corresponding attribute, is then thought 
of as a restricted piece of knowledge admitting a direct access “by name”. To 
put this another way, the possible queries to a KR-system are supposed to be of 
the kind “What is the value of this attribute?” . 

In this paper we do not discuss relations between a KR-system and the world 
of entities. In particular, the question whether the chosen system of attributes 
is sufficient to separate entities does not concern us here; we avoid entities from 
the following discussions at all (the only exception is at the beginning of the 
final section). 

The projection tTq provides the potential value of a (the answer to the query) 
in a given state of the system. Since 7T is a cone, if a is an arrow from a to b, 
then values of a can be calculated from those of b in virtue of the corresponding 
C-morphism W := V{a)\ V), — Va- 

What about the possibility to get to know values of several attributes simul- 
taneously? Let us call a set A of attributes compatible if there is an attribute 
b and a set of coordinated arrows to b from each a G A: then the values of all 
a’s can be consistently calculated from the values of b. More precisely, if Aa is 
the embedding functor of the full subcategory of A spanned on A into A, then 
A is compatible if and only if there is a cone from Aa to b (some authors call 
such cones cocones). We postulate that the query ’’What are the values of at- 
tributes in A?” does not make sense for the given KR-system if the set A is 
not compatible in the system. Thus the category of attributes should to be rich 
enough. 

We call the attribute category A (finitely) saturated if every (finite) compat- 
ible set of attributes is presented by a complex attribute, i.e. if every functor 
Aa with A (finite and) compatible has the colimit. In particular, then A has an 
initial element. 



2. This informal semantics becomes very suggestive in the case when C is the 
category of sets. However, the described interpretation of A-arrows then actually 
brings about the restriction that each morphism set A(A, K) should contain at 
most one arrow (see ( 12 . 21 ) below), i.e. that A should be merely a preordered set. 

Suppose that S := (A, V, S, 77) is a KR-system relatively to §. A (full) de- 
scription is defined to be a function d on |A| that assigns an element of Va to 
every attribute a so that 



d{a) = fx{d{b)) 



( 2 . 2 ) 
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for every arrow a: a ^ b. We can associate with any state s G S the description 
ds defined by ds{a) := 7Ta(s); thus, the cone P might be replaced by the family 
{ds'. s G S). Conversely, any family {dt- t G S') of descriptions gives rise to a 
KR-system (A, R, 5", il'), where 7r^(t) = dt(a). 

Now let K ■— {(a, u): a G lAI,^ G W} be the set of all pieces of knowledge 
directly available in S (see above). We say that a piece (a,u) is less informative 
than (b,v) (in symbols, (a,u) < (b,v)), if there is an arrow a: a & in A with 
u = 'fe'(t)). The relation < is a preorder on K; for the sake of the simplicity we, 
however, assume in this informal discussion that it is an order. We are going to 
show that, under certain natural conditions, the poset K can be endowed with 
a kind of entailment relation in the sense of |3|; the details, however, will be 
omitted. 

Some authors call a poset a domain if it is finitely bounded complete, i.e. 
if every finite subset of it bounded from above has the least upper bound. The 
empty subset is not excluded; so a domain has the bottom T. Now, if A is finitely 
saturated, then the poset AT is a domain. We assume that it is the case. 

Let as define the forcing relation |h on S' x AT is by 

s |b(a, u) := 7r^(a) = u . 

A simple axiom checking shows that then the pair (S, jh) is a possible world space 
for AT in the sense of |5|. Hence, the relation \= naturally defined by 

(6, v) 1= (a, u) iff Vs(s |l-(6, u) => s |b(a, u)) 

is an entailment on AT (see jSl Proposition 3]). 

If A is even saturated, then the domain K is bounded complete, hence, a 
meet semilattice, and can be equipped with the knowledge revision operation 
along the lines of Sect. 6 in 0. 

3. Next we describe the concept of simulation for our abstract KR-systems. 

Definition 2. Let S := (A,R, S, 7T) and S' := (A', W, S' 77') be two KR sys- 
tems. A simulation of S into S' is a triple m := (7, A, m), where 

— I is a functor A — >■ A' that preserves existing colimits, 

— X is a contravariant natural transformation V'l — >■ V, viewed as a family of 
C-morphisms 

(^a- ^/(a) W, aG |A|) 

such that every arrow a: a ^ b from A yields the identity 

V{a)Xk = XaV'{I{a)) , (2.3) 

— m is C-morphism S — >■ S' such that 

TTa = Aa m (2.4) 



for all a G A. 
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We say the S' simulates S if there is a simulation of S into S' , and that the 
systems are equivalent if each of them simulates the other. Of course, equivalent 
systems need not be isomorphic. 

Proposition 1. KR- systems and their simulations make up a category, the 
composition of simulations being defined componentwise. 

We denote this category by KR(C), or just KR if misunderstanding is not 
likely. Given a subcategory % of KR, we call a KR-system S from % 

— reduced in % if the cone 77 is separating: any two parallel 3C-morphisms ip 
and tp to S coincide whenever iTaif = T^aip for all a S |A|, 

— universal in JC if every KR-system from 3C is simulated by it. 

Note that a universal system in this sense may be not universal in the sense of 
category theory: it need not be a terminal object of DC. One reason for this is 
that a terminal KS-system is always reduced. However, all universal systems are 
equivalent. 



4. We are now going to isolate certain interesting subcategories of KR in which 
the definitions of a KR-system and simulation can be considerably simplified. 
(Some special categories of Pawlak’s information systems are studied in |l 4j.) 

A frame over C is defined to be a pair (A, V) satisfying the two first axioms 
of a KR-system. In particular, the pair consisting of the first two components of 
a KR-system S is said to be the frame of S. Clearly, every frame is the frame 
of an appropriate KR-system. 

We denote by KR[A, K] the full subcategory of KR consisting of all KR- 
systems having a fixed frame (A, K), and of those simulations in which I is the 
identity functor and A is the identity transformation. Then holds trivially. 
We may regard the objects of KR[A, V] to be just pairs {S, 77) satisfying the last 
two axioms of a KR-system, and morphisms to be just C-morphisms m satisfying 
the correspondingly reduced version of (El, i.e. the condition tt^ = 7t(j m. The 
following simple category-theoretic observation is useful. 

Proposition 2. The following statements are equivalent for a KR-system S := 
(5,77) from KR[A,K].- 

(a) S is terminal, 

(b) S is universal and reduced, 

(c) S is a limit of the functor V , and 77 is the corresponding limiting cone. 

Therefore, if the category C is small-complete, then each KR[A, V] has a ter- 
minal system. For example, if C = § and {dg: s G 5) is a family of all descriptions 
(relatively to (A,K)), then the KR-system (5,77), where each tTq is defined by 
7Ta(s) := dg{a), is terminal. 
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3 Uncertainty in KR-Systems 

1 . KR-systems, as they were defined in the preceding section, seem to be de- 
termined: every state completely determines values of all attributes. We shall 
show in this section that KR-systems with uncertainty nevertheless fall under 
the same formal scheme. 

To this end, we fix some monad F := over C. The functor F is 

interpreted as fuzzification (in a wide sense of the term) . Thus, if C is the category 
of sets, the object F{X) consists of “fuzzy subsets” of X interpreted as available 
“vague pieces of information” about, or “vague specifications” of, elements of X, 
while the function F{(p), where Lp itself maps a set X into Y , transforms a vague 
specification of, say, an element x € X into the corresponding vague specification 
of (fi{x) G Y. The function rjx ■= vi^)- ^ F{X) associates with each element 
of X a “crisp specification” of this element; so such a specification is a particular 
case of vague ones. At last, the function px ■= m(A'): F{F{X)) — >• F{X) provides 
completeness of F{X) in the sense that a “vaguely specified vague specification” 
over X always is a vague specification over the same X. 

A useful derived notion related to monads is that of an extension map. Given 
a C-morphism <p: X ^ P{Y), its extension F{X) — >■ F{Y) is defined to 
be the composition pyF{‘p). Conversely, F{(p) = ( 77 ^ 93 )^* and p,x = {sp(^x))K 
where £y stands for the identity morphism of Y. One may view a C-morphism 
tp: X ^ F{Y) as an A-indexed family of specifications over Y, and a vague 
specification over A as a specification over this family. Then (p^ converts such 
a vague specified member of the family into a specification over Y. Thus the 
extension map j) is a counterpart of p; in fact, the monad could be replaced by 
the triple (A, ry, H), where F is now treated merely as an object map. 

This interpretation of monads is detailed, and an appropriate general math- 
ematical theory of fuzzy sets is developed, in |B|. In particular, it is shown in 
Sect. 1 of 0 that crisp, probabilistic, possibilistic and Zadeh’s fuzzy set theories, 
as well as some other models of uncertainty, all come under this pattern. The 
main result of [i^ is that so does the theory of Goguen’s L-fuzzy sets. These ideas 
come back to ^ brief account of them is given in the review |C]. See, however. 
Sect. 7.2 of PI for some criticism on, and modification of, this approach. 

Following jSJ Def. 4.4], we call the monad F consistent if the functor F is 
faithful or, equivalently, if all the morphisms r]x are monomorphisms. A monad 
is interesting as a tool for fuzzification only if it is consistent (in S, if F is 
inconsistent, then no object F{X) contains more than one element). Accordingly, 
in what follows we assume that F is consistent. 

A morphism from A to F(Y) in C may be interpreted also as a fuzzy relation 
between A and Y. Any morphism y: A — >■ A gives rise to such a relation •= 
rjY p- We recall that the Kleisli category of F is the category Cf with the same 
object class as C and morphism classes Cf(A, A) := C(A, F(A)). Gomposition is 
given in Cf by t/" o := ip'^p . Glearly, Cf reduces to C if F is the trivial monad. 
There is a simple functor b: C — >■ Cf defined by 

b(A) := A, b(x) := y' . 



(3.5) 



Are There Essentially Incomplete Knowledge Representation Systems? 101 



It follows from Theorem V.6.2] that the functor preserves colimits, for it has 
the left adjoint ft: Cf — C which takes any Cp-object X into F{X) and each 
Cp-morphism ip into (|B1 Theorem VI. 5.1]). 

2. Now we turn to the category KR(Cp). Its objects, KR-systems over Cp, will 
be reinterpreted as generalised KR-systems over C. This means, first of all, that 
we still shall regard Va rather than F( 14) to be the stock of values for a. While a 
KR-system in the previous sense (i.e. an object of KR(C)) can be regarded as as 
a system with complete information, there are two different ways in which un- 
certainty may appear in such a generalised system S := (A, V, S, U). First, each 
TTa, being a C-morphism S — )> F{Va), now provides vague information about the 
values of the attribute a; this might mean that knowledge states in S, generally, 
do not contain full information about entities (another possible explanation is 
discussed in the next section). Second, values of a need not be completely de- 
termined any more by those of b when there is an arrow from a to b: instead of 
crisp functional dependencies between attributes fuzzy dependencies now may 
appear. 

For all that, this vagueness is apparent in some cases, as every KR(C)-system 
T (A, U, S, P) can be identified with the KR(Cp)-system := (A, \>U, S, P^), 
where \>U is the composition of U with the functor b defined in li:i.t)l) . and P^ 
is the cone (p^: S — >■ bC/(a)) (see Theorem [D below) . Recall that b preserves 
colimits; hence, so does bC/ as well, and T'’ is indeed a KR-system. 

Definition 3. A KR(Cp)-sysfem S is said to be (information) complete if it can 
be presented as for some T from KR(C), and incomplete if otherwise. 

3. We now give an example of an extended KR-system which was studied in 
other connection in HE!. 

Let Pq be the (weak) powerset endofunctor of 8 defined as follows: If M is 
a set, then Pq{M) stands for the set of non-empty subsets of M, and if / is a 
function M ^ N, then Po{f) stands for the mapping Pq{M) — Pq{N) defined 

by 

PoimX) :={f{x):xGX} . 

In fact, this endofunctor induces a monad P* := {Po,T],p), where ijm takes 
every element x of M into the singleton {x}, and pM takes a non-empty set U 
of non-empty subsets of M into the union of U. 

Let X and Y be nonempty sets, and let X* and Y* be the set of all words 
(finite strings) over X, resp. Y. We first consider X* and Y* as posets, where 
p < q means that the word p is an initial segment of q. Hence, X* may be viewed 
as a category in the usual fashion. This gives rise to a functor V: X* — >■ 8, the 
set Y^P^ of all words from Y* of the same length as p being assigned to a word 
p, and the mapping inif: Y^'^'> — >■ Y^p'> such that inif{b) is an initial segment of 
b, to an arrow p ^ q. 

A sequential ND-operator over [V, V] (see jlTl Def. II. 4.3]) is defined to be a 
mapping ip-. X* — >■ Pq{Y*) subject to the axioms 
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- (VgGX*)<^((7) 

- (Vp e X*)(Va G p(p))(Vg GX*){p<q^ {3b G F*) (a < 6&6 G ip{q))), 

- (Vg G X*){yb G p(g))(Va G F*) (a < 6 ^ (3p G X*) (p < g&a G (p(p))). 

According to Theorem II.4.1 in every such an operator characterises be- 
haviour of an appropriate initial non-deterministic automaton (and conversely). 
Let {(fis, s G S) he a family of sequential ND-operators, and assume that, for each 
q G A*, the function ng-. S — >■ Po{Y'^) is defined by TTq{s) := ips{q)- The family 
n of all such functions is a cone from S to the functor Pg V in the category of 
sets. Now, the system {X* ,V, S, II) is a knowledge representation system with 
respect to the monad Pg. It can be shown that this notion is equipollent with 
that of non-deterministic automaton with a fixed subset of initial states. Note 
that its attribute system X* is finitely saturated (in fact, even saturated). 

4. The notion of simulation has also extended in KR(Cjr). Indeed, related back 
to C, a simulation of S into S' consists of a functor J: A — >■ A', a natural C- 
transformation A: V'l — >■ FV, and a C-morphism m: S' — >■ P(S'). Then the 
identities (O) and dZil characterising simulations take the form 

{V{a)f \b = \JV'{I{a)) and T^a = >^J m , 

respectively. We are now going to pick out some simulations of a special type. 

Let S ■— (A,F, S, P) and S' := {A' ,V , S' , II') be two KR-systems from 
KR(C][r). Suppose we are given a triple n := (/, k, n), where / is a functor A — ^ A', 
is a contravariant natural transformation V'l — >■ V and n is a C-morphism 
S — >■ S'. Suppose, furthermore, that X = (i.e. for all a G |A|) and 

m = n*'. Then A is a contravariant natural C-transformation V'l — >■ FV and m 
is a C-morphism S — >■ P(S). We denote the derived triple (/, A, m) by •n}’. Note 
that due to consistency of F, the triple n is uniquely determined by . 

Definition 4. The simulation m := (/, A, m) is said to be strict if it can be 
presented as n} for some triple n. 

It follows that S is simulated by S' iff S is strictly simulated by the system 
S'^ := (A',F',P(S"),P'“), where stands for the family «*,a G A'). 

The following technical lemma gives a useful independent characteristic of 
strict simulations. We note that the consistency of F is essential for the ’if’ 
direction in it. 

Lemma 1. nf is a simulation of S into S' if and only if 

V{a) Xb = F{>Ca)V'{I{a)) and iTa = F{xa) n . (3.6) 

For example, every KR(C)-simulation induces a strict KR(C]F)-simulation. 
Suppose that T := {A,U,S,P) and T' := {A' ,U' , S' , P') are KR-systems from 
KR(C) and that n := (I, x, n) is a simulation of T into T' . Then it follows from 
the lemma that the triple rf’ is a strict simulation of into T' . Every strict 
simulation between complete KR-systems arises this way. 

Observe that strict simulations constitute a subcategory of KR(Cf) with the 
same object class. Let us denote it by KRs(Cf). 
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5. We sum up a part of the above discussion as follows. 

Theorem 1. (a) Complete KR-systems and their strict simulations make up a 
subcategory KRc(Cf) o/KRs(Cf). 

(b) The transformations T i-T T°,n i— >■ rf' yield a faithful functor KR(C) — >■ 
KRs(Cf). In fact, it is an isomorphism o/KR(C) onto KRc(Cf). 

To underline the described reinterpretation of the category 3C1 R(C]f) as the 
category of fuzzy KR-systems relatively to C, we rename it into KR’^(C). More- 
over, the category C will be supposed to be fixed in the rest of the paper, and 
we usually shall omit the reference to it in notation of this kind at all. 

We retain the previous definitions of a reduced, universal and terminal sys- 
tem and a frame. Needless to say, all the observations made at the end of the 
preceding section hold true in KR®'. As to the available pieces of knowledge (when 
C = 8), there are good reasons to consider now the wider set 

:= {{a, a): a G |A|,cr G F{Va)} 

rather than K . In the light of the short discussion on K in Sect. 2.2 , investigating 
the structure of AT*' seems to be of interest. 



4 Is an Incomplete KR-System Substantially Incomplete? 



1. In general, there is a suggestive analogy between KR-systems, on the one 
hand, and quantum systems on the other: observables of a quantum system are 
replaced by the attributes in a KR-system, and the probabilistic distributions 
on the value set of an observable is a kind of vague specifications on the set. 
Such an analogy between automata and quantum systems has been noticed as 
early as in COI; see also f4ll7llS| . 

The famous hidden variables problem in quantum mechanics arises from at- 
tempts to interpret quantum systems classically. The indeterminateness of such 
a system could be explained this way: its states are thought of as probability 
distributions on some hypothetical set of sharp, or deterministic, states, each of 
which completely determines values of all observables. The essence of the prob- 
lem, then, is to show that it is always possible to find additional parameters (the 
hidden variables) which could operationally describe these deterministic states 
and, hence, help to restore classical physics. It is known that the hidden vari- 
able problem in quantum mechanics is solved negatively. See, e.g., HZ! for more 
information. 

One could try to explain incompleteness of a particular KR-system after 
the same fashion. Our experience with Pawlak’s information systems in Sect. 1 
shows that it may well be that an entire class of KR-systems admits a uniform 
way to introduce “hidden variables”. On the other hand, experience of quantum 
mechanics holds us back from being too optimistic in this respect. Moreover, 
we should take into account the possibility that a KR-system with uncertainty 
(traditionally called incomplete) is, in fact, complete in the sense that each of its 
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states provides maximally full information about the corresponding entity: they 
could be the entities themselves which actually have uncertain values of their 
attributes. 



2. We now move from this informal discussion to more precise formulations. 

Let X be some subcategory of KR*^. When speaking on simulation of a X- 
system, we shall always have in mind its 3C-simulation (i.e., by another 3C-system, 
and via a simulation from 3C) . 

Definition 5. A KR-system from X is said to be weakly complete in X if it is 
simulated by a complete system. We say that X has enough complete systems, 
or is uncertainty-free if every KR-system in X is weakly complete. 

For example, the category of Pawlak’s information systems has enough com- 
plete systems. Now we can state our principal question as follows. 

Problem 1. Has the category KR®' enough complete systems? 

If it is not the case, then two other problems arise: 

Problem 2. Characterise the uncertainty-free subcategories of KR’^. 



Problem 3. Find interesting examples of subcategories of KR®' that have not 
enough complete systems. 

The following observation on categories with universal systems is helpful 
when C is small complete. 

Proposition 3. If the category X has universal systems at all, then it is uncer- 
tainty-free if and only if it has a complete one. 

A bit more can be said about the category of KR-systems generated by a 
frame. 

Let (A, R) be some frame relatively to Cp. We assume that there is also a 
frame (A, U) relatively to C such that V = \>U . Obviously, this condition is nec- 
essary and sufficient in order that KR’^[A, C] could have complete KR-systems. 
Note that a system S is universal in KR”^[A, K] iff is (see the paragraph just 
after Def. 4 for this latter notation). 

Theorem 2. The following holds for the category KR’^[A, V]: 

(a) z/KR’^[A, K] has enough complete systems, then it has an universal system 
if and only if so does KR[A, [/], 

(b) if S and T are universal systems in KR*'[A,K] and KR[A, K] respectively, 
then KR®'[A,y] has enough complete systems if and only if S is equivalent 
to t\ 
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Abstract. This paper presents the results of using sequential analysis 
to find increment sequences that minimize the average running time of 
Shellsort, for array sizes up to several thousand elements. The obtained 
sequences outperform by about 3% the best ones known so far, and there 
is a plausible evidence that they are the optimal ones. 



1 Shellsort 

A well implemented Shellsort is among the fastest general algorithms for sorting 
arrays of several dozen elements, and even for huge arrays it is not prohibitively 
slow. Moreover, it is an adaptive method that runs faster on “nearly sorted” 
arrays that often occur in practice. Published by D. L. Shell in 1959 El , it is 
one of the earliest sorts discovered, it can be easily understood and implemented, 
yet its analysis is difficult and still incomplete. 

Shellsort for N elements X[0, . . . , iV — 1] is based on a predetermined sequence 
of integer increments 0 < ho, . . . , ht-i < N, where /iq = 1. In general, the 
increment sequences can change with N, but customarily initial elements of 
some infinite sequence are used for simplicity. 

The algorithm performs t passes over the array: one pass for each increment 
from ht-i down to /iq. The pass number t — k sorts by straight insertion all the 
subarrays that consist of elements hk apart: X[0, hk, ■■ .],X[l,hk + I, , 

X[hk — 1,2/ife — 1, . . .]. This way each pass involves sorting subarrays that are 
either small or nearly in order, and straight insertion sort performs well in these 
circumstances. 

The number of operations made by the algorithm depends on the incre- 
ment sequence, and indeed many sequences have been proposed and used. Ref- 
erences 0H3 contain broad surveys of previous research on Shellsort. Theoreti- 
cal analysis of its running time is, however, confined to worst case. The average 
running time was susceptible to analysis only in cases that do not cover the 
sequences used in practice i2Eini. Also, sequences of increments that minimize 
the average running time of Shellsort were not known so far. 

2 Sequential Analysis 

Sequential analysis is a method of verifying statistical hypotheses developed by 
Abraham Wald in the 1940s m- Whereas classical statistical criteria fix the 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 100- 17771 2001. 
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size of the random sample before it is drawn, in sequential analysis its size is 
determined dynamically by analyzing a sequentially obtained series of data. 

Sequential analysis has been employed in fields, where sampling is costly, 
for example drug investigation and destructive qualification testing of goods. 
We use it to determine in reasonable time the best increments for the average 
case of Shellsort. For example, suppose that we are interested in minimizing 
C, the number of comparisons made by a five-increment Shellsort when sorting 
128 elements. A good fairy tells us that there are only a few dozen sequences, 
for whose the average number of comparisons EC is less than 1005, and the 
distribution of C for all of them, being inherently discrete, can be approximated 
by the normal distribution with a standard deviation SC ~ 34. After all, EC < 
1005 surely causes SC to be < Cmax = 40. 

Using this information, we can construct a sequential test that acts like a low- 
band filter and allows to shorten the time of computations by a factor of thou- 
sands. We are willing to accept a good sequence when its EC < 9q = 1005 and 
reject a bad one when, say, EC > 9i = 1015. We consent to accidental rejecting a 
good sequence with probability a = 0.01 and accidental accepting a bad one with 
probability (i = 0.01. With each sequence probed, we are running Shellsort on 
randomly generated permutations and summing c^, the number of comparisons 
made in the tth trial. We prolong the test as long as 



Ofc = 



9i-9o 1-a 2 - ^ 9i- 

1 — 1 



In • 



^max 



= Tk- 



If the sum is less than Uk, the sequence is accepted; if it is greater than rk, the 
sequence is rejected; if it is between ak and r^, k gets incremented, ak and rk 
are adjusted, and another trial is made. 

The sequences that passed the test have biased estimate of the average num- 
ber of comparisons, so it has to be evaluated again on some number of indepen- 
dent permutations, but the vast majority of sequences (those with large EC) is 
rejected in the test after just a few trials. Fig. 1 shows the mean and standard 
deviation of the number of comparisons made by Shellsort using 790 sequences 
that passed the test described above, evaluated subsequently in 10000 trials. 

To make the sequential test fast, it is essential to choose 9q near the actual 
minimal EC. To estimate EC and SC of the best sequences in default of a fairy, 
we can use a random sample of sequences that begin with increments known to 
be good (more on good increments below). 

If our assumptions are true, we should have missed no more than 1% of se- 
quences with EC < 1005. In fact, the distribution of the number of comparisons 
is not normal, but skewed (see Fig. 2 for a typical example). In an attempt to 
compensate this asymmetry, the chosen value of dmax is greater than the actual 
standard deviations. Moreover, by the Central Limit Theorem, the sum of Ci’s 
obtained in independent trials tends to the normal distribution. 

This reasoning can seem fallacious, as it involves knowing in advance, what 
we are looking for [Jj but in fact it is not. If there exists a sequence with EC < 9q 
and SC > a, the probability that it was accidentally rejected is less than 1/2 
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28 I 1 ^ ^ ^ 1 ^ ^ 1 

1002 1004 1006 1008 1010 1012 1014 1016 1018 

mean 

Fig. 1. Mean and standard deviation of the number of comparisons in Shellsort. Five- 
increment sequences for sorting 128 elements that passed the sequential test are shown 




comparisons 



Fig. 2. Distribution of the number of comparisons in Shellsort using the sequence 
(1,4,9,24,85) for sorting 128 elements {solid line), and the normal distribution with 
the same mean and standard devistion {dashed line) 
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in the symmetrical distribution model, and still less than one for any skewed 
distribution. Therefore, when the search is independently run several times, the 
sequence should pass the test at least once. It never happened in author’s search: 
the standard deviation of the best sequences was always similar. 



3 The Dominant Operation in Shellsort 

Almost all studies of Shellsort treat its running time as proportional to the 
number of element moves. It is probably because the number of moves can be 
expressed in terms of the number of permutation inversions, and there are known 
techniques for inversion counting. These techniques give satisfactory answers for 
algorithms like insertion sort, where the ratio of the number of comparisons to 
the number of moves approaches 1 quickly. 

In Shellsort, the picture is different. Figures 3 and 4 show the average num- 
ber of computer cycles per move and per comparison for 10 < N < 10®. 
They concern Knuth’s implementation of Shellsort for his mythical computer 
MIX and several increment sequences: Hibbard’s (1,3,7,15,...) 0, Knuth’s 
(1,4,13,40,... I 2hk < N) 0, Tokuda’s {hk = [(gd)'^ - 4)/5l | \hk < 
N) P21) Sedgewick’s (1,5,19,41,...) 0, Incerpi-Sedgewick (1,3,7,21,...) 0, 
and (1, 4, 10, 23, 57, 132, 301, 701) (up to N = 4000) [see below]. Knuth’s discus- 
sion assumes that the running time can be approximated as 9 x number of moves, 
while Figures 3 and 4 show that for each sequence the number of key compar- 
isons is a better measure of the running time than the number of moves. The 
asymptotic ratio of 9 cycles per move is not too precise for N < 10®, and, if some 
hypothetical sequence makes 0(7Vlog A^) moves, it is never attained. Analogous 
plots for other computer architectures would lead to the same conclusion. 

Treating moves as the dominant operation leads to mistaken conclusions 
about the optimal sequences. Table 1 leads us to believe that the move-optimal 
sequence is Pratt’s one (1,2,3,4,6,8,9,12,...) = {2^3*^} 0, with 0(log^ A^) 
passes. However, the best practical sequences known so far are approximately 
geometrical, so they make 0(logAl) passes. Also, a recent result [B| is that if 
there is a sequence that yields Shellsort’s average running time ©(A^logA^), it 
has to make precisely 0(log77) passes. 

Compare-optimal sequences seem to make 0{\ogN) passes. It is illustrated 
in Tables 1 and 2 that show the best sequences of various length for sorting 
128 elements, obtained in the above described way with respect to the average 
number of moves and comparisons. In Table 1, there are no empirical sequences 
with more than 12 increments, since finding them would take too much time, 
but hopefully the point is clear. In Table 2, the difference between the last two 
lines is within possible error; sequences with more than six increments do not 
improve the results. 

In fact, concentrating on the number of moves, we can obtain sequences that 
are good for practical purposes, but we have to guess an appropriate number 
of passes for a given N, lest they be too ‘stretched’ or too ‘squeezed.’ When 
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Fig. 3. Average MIX computer time per move in Shellsort using various sequences of 
increments 
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Fig. 4. Average MIX computer time per comparison in Shellsort using various se- 
quences of increments 
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Table 1. Move-optimal sequences for sorting 128 elements 



Increments 


Moves 


Comparisons 


MIX time 


1 


4064.0 


4186.8 


37847 


1 8 


1280.2 


1506.2 


13958 


1 4 17 


762.80 


1090.69 


10422.4 


1 4 9 24 


588.25 


1018.74 


9956.9 


1 3 7 15 35 


506.56 


1032.39 


10256.0 


1 3 7 12 20 51 


458.18 


1071.99 


10761.8 


1 3 4 10 15 28 47 


427.43 


1151.14 


11625.6 


1 2 5 7 13 22 33 56 


405.20 


1220.71 


12393.3 


1 3 4 6 11 18 26 35 56 


389.36 


1308.48 


13323.4 


1 2 3 5 8 12 18 27 41 75 


375.70 


1390.80 


14301.2 


1 2 3 5 8 12 18 27 38 59 84 


365.83 


1440.45 


14717.1 


1 2 3 4 6 11 14 18 27 37 62 86 


357.63 


1545.17 


15793.3 


1 2 3 4 6 8 9 12 16 18 .. .96 108 


338.08 


2209.59 


22700.4 



Table 2. Compare-optimal sequences for sorting 128 elements 



Increments 


Comparisons 


Moves 


MIX time 


1 


4186.8 


4064.0 


37847 


1 9 


1504.6 


1280.7 


13945 


1 4 17 


1090.69 


762.80 


10422.4 


1 4 9 38 


1009.75 


598.90 


9895.0 


1 4 9 24 85 


1002.22 


538.06 


9919.9 


1 4 9 24 85 126 


1002.25 


535.71 


9933.2 



working with comparisons, at least choosing the number of passes too high does 
no harm. 



4 Further Enhancements to the Method 

The investigations were begun for small N and a broad range of sequences with 
2 < < 10 and 1 < hk/hk-i < 4 for A: > 1. It turns out that the best 

sequences had hi G {4,5} and hk/hk-i G (2,3), for A: > 0; except perhaps 
the last increments, where a larger value of hk/hk-i is sometimes better. Also, 
having hk+i mod = 0 is always a hindrance. 

The smallest increments are stable among the best sequences with a maximal 
t for various N. Indeed, for N greater than a few dozen, the best sequences are 
(1,4,9, . . .), (1,4, 10, . . .), (1,4, 13, . . .), (1,5, 11, . . .). A few other beginnings are 
also not bad, yet consistently worse than these four. The increment /13 crystal- 
lizes when N « 100, and the top sequences are (1,4,9,24,...), (1,4,9,29,...), 
(1,4,10,21,...), (1,4,10,23,...), and (1,4,10,27,...). 
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As N grows, the feasible values of the remaining increments show up, too: 
given {ho, ■ ■ ■ ,hk-i), there is always a small set of good values for hk- Fig- 
ures 5 and 6 show the average number of comparisons made by sequences be- 
ginning with (1,4,10,23) and (1,4,10,21) when sorting 300 and 1000 elements 
as a function of /14. 




Fig. 5. Performance of Shellsort that uses the sequences (1, 4, 10, 23, / 14 , /is) and 
(1, 4, 10, 21, / 14 , /is) for N — 300 depending on /14 



Thus, we can speed the search up, considering only the sequences beginning 
with these increments, and imposing more strict conditions on the remaining in- 
crements. The author would like to stress that he took a conservative approach 
and checked a much wider fan of sequences. It was seeing some patterns con- 
sistently losing for several N's that encouraged him not to consider them for 
a greater number of elements. 




Fig. 6 . Performance of Shellsort that uses the sequences (1, 4, 10, 23, / 14 , / 15 , /le) and 
(1, 4, 10, 21, / 14 , / 15 , ho) for N = 1000 depending on /14 
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5 The Results 

The size of this paper limits the author to present only a digest of his numerical 
results. Table 3 shows the best sequences of length 6-8 for sorting 1000 elements, 
and, for some increments, the best sequences beginning with them. The omitted 
sequences differ only with the biggest increments from those shown. However, 
there is a chance that some sequences that should be on these lists were acci- 
dentally rejected in the sequential test. 



Table 3. The best 6-, 7-, and 8-pass sequences for sorting 1000 elements 



Increments 


Comparisons 


1 4 13 32 85 290 


13059.0 ± 195.9 


1 4 13 32 85 284 


13060.4 ± 196.3 


1 5 11 30 81 278 


13061.5 ± 198.2 


1 5 11 30 81 277 


13063.1 ±201.2 


1 4 10 23 57 156 409 


12930.4 ± 157.5 


1 4 10 23 57 155 398 


12931.7 ± 157.5 


1 4 10 23 57 156 401 


12932.4 ± 157.6 


(21 seq. omitted) 




1 4 10 21 57 143 390 


12936.8 ± 157.9 


(14 seq. omitted) 




1 4 10 21 56 125 400 


12938.5 ± 157.0 


(22 seq. omitted) 




1 4 9 24 58 153 396 


12940.3 ± 158.9 



1 4 10 23 57 156 409 995 
1 4 10 23 57 156 409 996 
1 4 10 23 57 155 393 984 
(98 seq. omitted) 

1 4 10 21 56 135 376 961 
(18 seq. omitted) 

1 4 10 21 57 143 382 977 
(735 seq. omitted) 

1 4 10 23 61 154 411 999 
(366 seq. omitted) 

1 4 10 23 58 135 388 979 
(278 seq. omitted) 

1 4 9 24 58 153 403 991 



12928.9 ± 158.1 

12929.0 ± 157.2 

12929.1 ± 156.9 

12931.9 ± 155.8 

12932.1 ± 156.2 

12936.6 ± 159.9 

12937.9 ± 155.5 

12938.6 ± 158.1 



As the greatest increments play a minor role in the overall performance of 
a sequence, the author abandoned the idea of finding truly optimal sequences 
up to the greatest increment for greater N, and concentrated instead on finding 
the feasible values of the middle increments. 

To this end, 6-increment beginnings perform best for N = 1000,2000,4000 
were selected. For each of them, all the sequences with hg G (2/15,3/15) and 
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h-j G (2hQ,3he) were generated. For each of the 8-increment beginnings ob- 
tained, 100 random endings hs G (2/17,8/17), hg G (2/is, min(3/i8, 8000)) were 
drawn. The sequential test was then run for each 10-increment sequence for 
N = 8000. The percentage of (/ig, hg) endings that passed the test was recorded 
for each (hg , . . . , / 17 ). The top 8-increment beginnings were re-examined in 10000 
independent tests. 

Table 4. The best 8-increment beginnings of 10-pass sequences for sorting 8000 ele- 
ments 



Increments 



Ratio passed 



1 4 10 23 57 132 301 758 


0.6798 


1 4 10 23 57 132 301 701 


0.6756 


1 4 10 21 56 125 288 717 


0.6607 


1 4 10 23 57 132 301 721 


0.6573 


1 4 10 23 57 132 301 710 


0.6553 


1 4 9 24 58 129 311 739 


0.6470 


1 4 10 23 57 132 313 726 


0.6401 


1 4 10 21 56 125 288 661 


0.6335 


1 4 10 23 57 122 288 697 


0.6335 



From limited experience with yet larger array sizes, the author conjectures 
that the sequence beginning with 1, 4, 10, 23, 57, 132, 301, 701 shall turn up opti- 
mal for greater N. 



6 Why Are Some Sequences Better than Others 

It seems that some sequences are better than others on the average not only due 
to a good global ratio hk+i/hk, but also because they cause less redundant com- 
parisons, that is comparisons between elements that have already been directly or 
indirectly compared, which depends on local interactions between passes. Tables 
5 and 6 show the average number of comparisons Cj and redundant comparisons 
Ri in each pass for two increment sequences. 

Some heuristic rules on good sequences can be deduced from the observation 
that the subarrays sorted in each pass are quite well ordered and the elements 
move just a few h on the average in each /i-sorting, as exemplified in Fig. 7. 

Let’s consider hk+i expressed as a linear combination with integer coefficients 
of hk-i and hk- Patterns like these on Figures 5 and 6 can be forecasted to some 
extent using the following rule: if there is a solution of the Diophantine equation 
hk+i = ahk + bhk-i with a small value of |6|, say < 5, then this hk+i is certainly 
not a good choice. The value of b in analogous expressions with small multiples 
of hk+i on the left side is nearly as much important. 

The distribution of the distance travelled in a given pass is similar for all the 
elements far enough from the ends of the table. The order of two elements that are 
hk+i apart after /i^+i-sorting is known. In subsequent pass both elements move 
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Table 5. Average number of comparisons and redundant comparisons for Shellsort 
using the sequence (1, 4, 10, 23, 57, 132, 301, 701, 1750), for N = 40, 400, 4000 



hk 


Ci 


Ri 


23 


IT 


0 


10 


41.3 ± 2.6 


0 


4 


62.4 ± 5.5 


3.3 ± 2.3 


1 


90.8 ± 10.5 


25.0 ± 7.6 


S 211.5 ± 12.1 


28.3 ± 8.7 



hk 


Ci 


Ri 


301 


99 


0 


132 


334 ± 6 


0 


57 


543 ± 14 


0 


23 


720 ± 27 


2.6 ± 2.3 


10 


792 ± 38 


41 ± 14 


4 


809 ± 33 


158 ± 24 


1 


976 ± 41 


354 ± 33 


S 4274 ± 73 


557 ± 51 



hk 


Ci 


Ri 




1750 


2570 ± 10 


0 




701 


5200 ± 40 


0 




301 


6740 ± 70 


0 




132 


7720 ± 110 


7± 


4 


57 


8410 ± 180 


90 ± 


20 


23 


9020 ± 190 


370 ± 


50 


10 


8570 ± 130 


840 ± 


60 


4 


8310 ± 100 


1880 ± 


90 


1 


9830 ± 140 


3700 ± 


120 


S 66370 ± 430 


6890 ± 220 



Table 6. Average number of comparisons and redundant comparisons for Shellsort 
using Knuth’s sequence (1, 4, 13, 40, 121, 364, 1093), for N = 40, 400, 4000 



hk 


Ci 


Ri 


13 


36.4 ± 1.9 


0 


4 


74.0 ± 7.1 


0.02 ± 0.2 


1 


98.4 ± 13.0 


22.7 ± 9.2 


i; 208.8 ± 15.5 


22.7 ± 9.2 



hk 


Ci 




Ri 


121 


401 ± 


7 


0 


40 


713 ± 


21 


0 


13 


984 ± 


51 


53 ± 16 


4 


1223 ± 116 


164 ± 54 


1 


1092 ± 


47 376 ± 44 


i: 4413 ± 153 593 ± 81 



hk 


Ci 




Ri 


1093 


4480 ± 


30 


0 


364 


7430 ± 


50 


0 


121 


9850 ± 


170 


560 ± 40 


40 


11930 ± 


310 


1470 ± 120 


13 


14860 ± 


930 


2160 ± 280 


4 


15050 ± 


770 


3530 ± 460 


1 


11020 ± 


40 


4100 ± 100 


i: 


74620 ± 1800 


11510 ± 780 



1 

0.1 
0.01 
0.001 
0.0001 
1e-05 
1e-06 
1e-07 
1e-08 
1e-09 
1e-10 

-20 -15 -10 -5 0 5 10 15 20 




distance / h 



Fig. 7. Distance travelled by elements in subsequent passes of Shellsort nsing the se- 
qnence (1, 4, 10, 23, 57, 132, 301, 701, 1750) for sorting 4000 elements 
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a few hk to the left or to the right, and then their distance is d = hk+i + ahk- 
If d turns out to be a multiple of hk-i, then they are redundantly compared 
again in subsequent /ife_i-sorting. 

Unfortunately, the interdependence between the quality of a sequence and 
the solutions of equations hk+i = aghk + . . . + aihk-i becomes more and more 
obscure as I grows. However, there is some evidence that in good sequences 
the norm of the shortest vector-solution {gq, . . . , ai) for fixed I asymptotically 
grows as we move on to greater increments. It seems to exclude the possibility 
that the optimal uniform sequence can be defined by a linear recurrence with 
constant coefficients or by interleaving such sequences. See Fig. 8 for a plot 
of average number of comparisons made by Shellsort using various increment 
sequences. Both Knuth’s and Hibbard’s sequences are relatively bad, because 
they are defined by simple linear recurrences. The Sedgewick’s sequence that 
consists of two interleaved sequences defined by linear recurrences, also becomes 
to deteriorate for TV > 10®. 




N 



Fig. 8. Average number of comparisons divided by Ig A^! for Shellsort using selected 
increment sequences 



7 Summary 

Using sequential analysis, the search for optimal increment sequences for Shell- 
sort was accelerated enough to find them for arrays up to several thousand 
elements. The obtained results show that comparisons rather than moves should 
be considered the dominant operation in Shellsort. It was also possible to state 
some heuristic rules about good sequences of increments. 
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However, the sequences obtained so far are too short to draw a reliable 
conclusion whether an appropriate sequence of increments can make Shellsort 
a 0{NlogN) sorting method on the average. Some hypotheses may be possible 
when the sequences are prolonged to sort arrays of about 10® elements. 
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Abstract. A cocolouring of a graph G is a partition of the vertex set of 
G such that each set of the partition is either a clique or an independent 
set in G. Some special cases of the minimum cocolouring problem are of 
particular interest. 

We provide polynomial-time algorithms to approximate a mimimum co- 
colouring on graphs, partially ordered sets and sequences. In particular, 
we obtain an efficient algorithm to approximate within a factor of 1.71 a 
minimum partition of a partially ordered set into chains and antichains, 
and a minimum partition of a sequence into increasing and decreasing 
subsequences. 



1 Introduction 



A cocolouring of a graph G is a partition of the vertices such that each set of the 
partition is either a clique or an independent set in G. The cochromatic number 
of G is the smallest cardinality of a cocolouring of G. The cochromatic num- 
ber was originally studied in ^3- Subsequent papers addressed various topics 
including the structure of critical graphs, bounds on the cochromatic numbers 
of graphs with certain properties (e.g., fixed number of vertices, bounded clique 
size, fixed genus) and algorithms for special graph classes (e.g., chordal graphs 
and cographs) [ 11416171811111212 ( 7 ) . 

In this paper, besides cocolouring of graphs in general we study cocolour- 
ings of permutation graphs, comparability graphs and cocomparability graphs. 
The cocolouring problem on permutation graphs is equivalent to the cocolouring 
problem on repetition-free sequences of integers (one may assume a permutation 
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of the first n integers) which has the following motivation: if one has to sort 
such a sequence, it is desirable to have a partition into a small number of sets 
of already sorted elements, i.e., subsequences which are either increasing or de- 
creasing. Now the minimum number of monotone subsequences partitioning the 
original sequence is exactly the cochromatic number of the permutation graph 
corresponding to the sequence. This problem was studied in EE2. 

Wagner showed that the problem “Given a sequence and an integer k, decide 
whether the sequence can be partitioned into at most k monotone (increasing or 
decreasing) subsequences” is NP-complete jSQ. In our paper we provide a first 
constant-factor approximation algorithm for cocolouring sequences. More pre- 
cisely our algorithm approximates a minimum cocolouring of a sequence within 
a factor of 1.71. 

In fact we derive our 1.71-approximation algorithm for the minimum co- 
colouring problem on comparability (or cocomparability) graphs. This problem 
is equivalent to the cocolouring problem on partially ordered sets, i.e. the prob- 
lem to partition a partially ordered set P into a minimum number of subsets 
each being a chain or an antichain of P. Since every permutation graph is a 
comparability graph, our algorithm can also be used to approximate within a 
factor of 1.71 a minimum cocolouring of a permutation graph, and a minimum 
partition of a sequence of integers into increasing and decreasing subsequences. 

We also present a greedy algorithm to approximate a minimum cocolouring 
of perfect graphs within a factor of log n. Finally we show the following hardness 
result for the minimum cocolouring problem on graphs in general: for every e > 0, 
no polynomial-time algorithm approximates a minimum cocolouring within a 
factor of unless NP ^ ZPP. 



2 Definitions 

We denote hy G = (V,E) a, finite undirected and simple graph with n vertices 
and m edges. For every W C V, the subgraph of G = (P, E) induced by W is 
denoted by G[W]. For simplicity we denote the graph G\V \ A] by G — A. 

A clique G of a graph G = {V, E) is a subset of V such that all the vertices of 
G are pairwise adjacent. A subset of vertices / C P is independent if no two of 
its elements are adjacent. We denote by w(G) the maximum number of vertices 
in a clique of G and by a{G) the maximum number of vertices in an independent 
set of G. 

An r-colouring of a graph G = (P, E) is a partition {/i, J 2 , . . . , Jr} of P such 
that for each 1 < j < r, Ij is an independent set. The chromatic number y(G) 
is the minimum size of such a partition and k(G) = x(G*) is the minimum size 
of a partition {Gi,G 2 ,... ,Gs| of the vertices of G into cliques. Analogously, 
a cocolouring of G is a partition {Ji, J 2 , • ■ ■ , Jn Gi, G 2 , . . . , G^j of P such that 
each Jj, 1 < j < r, is an independent set and each Gj, 1 < j < s, is a clique. 
The smallest cardinality of a cocolouring of G is the cochromatic number z{G). 
Therefore, z(G) < min{x(G), k(G)|. 
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A graph G = (V,E) is perfect if x{G\W\) = io{G\W]) for every W C V. Per- 
fect graphs and classes of perfect graphs play an important role in graph theory 
and algorithmic graph theory. The following well-known classes of perfect graphs 
will be studied in the sequel: comparability, cocomparability and permutation 
graphs. For all information on these graph classes and their properties not given 
in our paper we refer to RO . 

Let tF — {Fi,F 2 , . . . , Fs} and T' = {F{, F 2 , . . . , F/} be two set families of 
subsets of a ground set U. We denote by IJF the set Ui=i ^i- We denote by 
Fi o T 2 the set family {Fi, F 2 , . . . , F«, F{ , F^, . . . , F'}. 

3 Partially Ordered Sets and Comparability Graphs 

Let P = {V{P),~<) be a finite partially ordered set, i.e. ^ is a reflexive, anti- 
symmetric and transitive relation on the finite ground set V{P). Two elements 
a,b of P are comparable if a ^ 6 or 6 ^ a. Now a subset G Q V (F) is a chain 
of P if every two elements of P are comparable, and a subset A C V (F) is an 
antichain of F if no two elements of A are comparable. 

An orientation FI = (V,D) of an undirected graph G = {V,F) assigns one 
of the two possible directions to each edge e G F. The orientation is transitive 
if (a, 6) G D and (6, c) G D implies (a,c) G D. A graph G = {V,F) is a compa- 
rability graph if there is a transitive orientation FI = (V, D) of G. A graph is a 
cocomparability graph if its complement is a comparability graph. 

Consider the following well-known relation between partially ordered sets 
and comparability graphs. Let F = {V{P),~<) be a partially ordered set. We 
define an undirected graph with vertex set V (F) and an edge between a and b 
iff a and b are comparable. Then this graph is a comparability graph, its cliques 
correspond to chains in F and its independent sets correspond to antichains in F. 
On the other hand, suppose G = (V, F) is a comparability graph, and let FI be 
a transitive orientation of graph G. Since FI is an acyclic and transitive directed 
graph it induces a partially ordered set with ground set V where u < w {u^ w) 
iff there is a directed path from rt to w in H . Now every chain in the partially 
ordered set corresponds to a directed path in H which corresponds to a clique 
in G due to transitivity. Furthermore every antichain in the partially ordered 
set corresponds to an independent set in G. Thus the well known Dilworth 
theorem saying that the maximum cardinality of an antichain in F is equal 
to the minimum number of chains in a chain partition of V{P) implies that 
comparability (and cocomparability) graphs are perfect. 

More important for our paper, a cocolouring of a comparability (or cocom- 
parability) graph G corresponds to a partition of a partially ordered set into 
chains and antichains. Now we study cocolourings of comparability graphs. 

A maximum k-colouring Xk is a family of k independent subsets of a graph 
G covering a maximum number of vertices. Let cxk{G) denote the size of the 
maximum ^-colouring, i.e. the number of vertices in a maximum fc-chromatic 
subgraph of G. A maximum h-covering Ch is a family of h cliques of G cover- 
ing a maximum number of vertices. We denote by Kh{G) the maximum size of 
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an /i-covering of G, i.e. the number of vertices in a maximum subgraph of G 
partitionable into h cliques. 

Our approximation algorithm is based on the following results by Frank uni 
which can be seen as algorithmic proofs of Greene and Kleitman ’s |14I15| gen- 
eralizations of Dilworth’s theorem. 

Theorem 1 (jlOj). There is an 0(nm) time algorithm which computes for any 
given comparability graph G = (V,E) simultaneously 

(a) for all integers k with 1 < k < x(G) a maximum k-colouring Ik, and 

(b) for all integers h with 1 < h < k{G) a maximum h-covering Ch- 

The essential part of the algorithm is a minimum-cost flow algorithm on a net- 
work associated to G (via a partially ordered set P having comparability graph 
G). 

We shall also need the procedure SQRTPartition which is based on a result 
by Erdos et al. [7] (see also Brandstadt et al. j2|). 



SQRTPartition 

Input: perfect graph G = (V,E) with n < vertices, k>2. 

Output: cocolouring Z oi G 

- Z:= it)] U :=V] 

— while G 7 ^ 0 do 

begin 

• if X{G[U]) < k-\-l then compute a /c-colouring 
I={h,l 2 ,... ,h}oiG[U]] Z:=ZoI 

else choose a clique G of size at least k -\- 1 
and add G to Z; 

• U :=U -\JZ] k:=k-l. 
end 



Lemma 1 (fjlj). For every perfect graph G = (V,E) with n < > 

2, procedure SQRTPartition outputs a cocolouring of size at most k. Thus 
z{G) < [-\/2n -I- 1/4 — 1/2J for every perfect graph. 

Proof. For the sake of completeness we provide the simple proof by induction 
on k. For k = 2 the theorem is true. Suppose that theorem is true for k > 2. 

Let G be a perfect graph with n < O+iKfc+2) vertices. If x(G) < k-\-2 then 
the procedure outputs a cocolouring of size at most fc -|- 1. 

If x(G) > k -\- 2 then the procedure chooses a clique G of G such that 
|G| > fc -I- 2 which exists by the perfectness of G. The procedure removes all 
vertices of G from G, thus the number of remaining vertices in G — G is less 
than . By induction hypothesis G — G has a cocolouring of size at most fc 

and thus the theorem is true for fc -|- 1. □ 
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Lemma 2. Procedure SQRTPartition has polynomial running time on perfect 
graphs and its running time on eomparability graphs is 0{y/nm). 

Proof. The running time of SQRTPartition depends on the best known run- 
ning time of an algorithm to solve the problems minimum colouring and maxi- 
mum clique on our special classes of graphs. 

The best known running time is polynomial on perfect graphs m and linear 
on comparability graphs HS|. □ 

Now we are in the position to describe our algorithm to approximate a minimum 
cocolouring on comparability graphs. 



Approx Cocolouring 

Input; comparability graph G = {V,E) 

Output: cocolouring Z oi G 

— Compute a ^-colouring Ik = {I\, I 2 , ■ ■ ■ ,Ik} and an h-covering Ch = 
{Cl, C 2 , . . . , Gh} of G such that the sum fc -|- Hs minimum subject 
to the condition au{G) + Kh{G) > n; 

- Z' :={/i,/2,... ,h,Ci\[jIk,C2\\JIk,--- ,Ch\\JIk}; 

— Compute a cocoluring Z” of the graph G — IJ Z' by calling SQRT- 
Partition; 

- Z :=Z'oZ". 



Theorem 2. The 0{nm) time algorithm Approx Cocolouring approximates 
a minimum eoeolouring of a eomparability graph within a faetor of 1.71. 

Proof. Let Ik = |/i, ^ 2 , • ■ • , Ik} and Ch = jCi, C 2 , . . . , Gh} be the sets produced 
at the first step of the algorithm. Then by the choice of k and I as well as Ik 
and Ch we have that k + h < z(G). 

The number of vertices in Z' is at least 

> n — kh 

since Ik is a family of independent sets and Ch is a family of cliques, implying 
\\J{Ik)(l{\JCh)\<kh. 

Therefore, the graph G — IJ Z' has at most kh vertices and by Lemma ^ 
procedure SQRTPartition computes a cocolouring of G — IJ Z' having size at 
most \/2kh. Consequently, Approx Cocolouring computes a cocolouring Z 
of G of size at most 

k+h+ <{k+ h){l -k ^) < (1 -k ^)z{G) < 1.71 • z{G). 

v2 v2 



The time bound follows from Theorem Q and Lemma |2l 



□ 
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Corollary 1. The algorithm Approx Cocolouring can also be used to ap- 
proximate within a factor of 1.71 

(a) a minimum cocolouring of a partially ordered set, 

(b) a minimum partition of a (repetition- free) sequence (of integers) into in- 
creasing and decreasing subsequences, 

(c) a minimum cocolouring of a permutation graph, and 

(d) a minimum cocolouring of a cocomparability graph. 

4 Perfect Graphs 

We consider the following greedy algorithm for minimum cocolouring on graphs 



Greedy Cocolouring 
Input: graph G = {V,E) 

Output: cocolouring ^ of G 

- Z := 0; 

- C7 := I/; 

- while [/ yf 0 do 

begin 

• Compute a maximum independent set Ijj and a maximum clique 
Cu of G[C/]; 

• Choose X to be lu or Cjj such that lAj = max(|/[/|, \ Cu\) and 
add X to Z-, 

• U :=U -X. 

end 



Theorem 3. The Greedy Cocolouring algorithm approximates a minimum 
cocolouring of a perfect graph within a factor ofhin. 

Proof. To obtain the approximation ratio of the algorithm let us consider a 
hypergraph H = (V,Eh), where the vertex set of H is the vertex set V of the 
input graph G and E^ is the set of all independent sets and cliques in G, i.e. 
every hyperedge of 71 is either an independent set or a clique in G. 

Any minimum cocolouring on G is equivalent to a minimum set cover of 
TL and vice versa. Moreover Greedy cocolouring can be seen as the greedy 
algorithm for the minimum set cover problem on input 71 (the only difference 
is that Greedy cocolouring won’t inspect all hyperedges of 7L). It is well 
known that the greedy algorithm for the minimum set cover problem 

is an Inn-approximation algorithm. □ 

By a well-known result of Grotschel et al. m a maximum independent set 
and a maximmum clique can be computed by a polynomial-time algorithm on 
perfect graphs. 
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Corollary 2. The Greedy Cocolouring algorithm is a polynomial-time al- 
gorithm to approximate a minimum eoeolouring within a faetor of In n on each 
graph class Q for which there are polynomial-time algorithms to compute a max- 
imum clique and a maximum independent set. In particular this is the case for 
perfect graphs. 

5 Hardness of Approximation 

The following lemma allows us to obtain a first hardness result concerning the 
approximation of minimum cocolouring. 

Lemma 3. m x{G) = z(nG), where nG is the disjoint union of n copies of 

G. 



We shall rely on a well-known result on the hardness of approximating the 
chromatic number of a graph. 

Theorem 4. It is hard to approximate the chromatic number to within 
for any e > 0, assuming NP ^ ZPP. (Here, ZPP denotes the class of languages 
decidable by a random expected polynomial-time algorithm that makes no errors.) 

Combining Lemma 0 with Theorem 0 we obtain the following 

Theorem 5. It is hard to approximate the cochromatic number to within n^G-f- 
for any e > 0, assuming NP ^ ZPP. 

Proof. Let G be any input to the problem minimum colouring. Then G' = nG is 
our input to the problem minimum cocolouring. By Lemma 0 we have x(G) = 
z{nG). Since the number of vertices of G' is the square of the number of vertices 
of G, by Theorem 0 we obtain the theorem. □ 

6 Concluding Remarks 

We leave many questions unanswered, a few of them are: 

1. The problem of finding a minimum partition of a sequence into monotone 
subsequences is NP-hard. We provide a 1.71-approximation algorithm for 
this problem. A natural question is if there exists a PTAS for this problem? 

2. We have proved that Greedy Cocolouring is a Inn-approximation al- 
gorithm for perfect graphs. Are there nontrivial classes of perfect graphs 
for which Greedy Cocolouring approximates the cochromatic number 
within a constant factor? 

3. What is the computational complexity of computing a maximum /c-colouring 
and a maximum /i-covering for perfect graphs? A polynomial time algo- 
rithm computing these parameters for perfect graphs will imply that our 
1.71-approximation algorithm for a minimum cocolouring on comparability 
graphs is also a polynomial time algorithm on perfect graphs. 
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Abstract. We consider the problem of drawing a graph where edges 
are represented by smooth curves between the associated nodes. 
Previously curved edges were drawn as splines defined by carefully 
calculated control points. We present a completely different approach 
where finding an edge is reduced to solving a differential equation. This 
approach allows to represent the graph drawing aesthetics directly, 
even the most complex ones denoting the dependencies among the paths. 

Keywords. Graph drawing, edge placement, curved edges. 



1 Introduction 

Edge placement is one of the fundamental problems in graph drawing 0. There 
are two traditional strategies for edge placement: edges are placed together with 
nodes, or nodes are laid out first and then edges are routed. Placing edges to- 
gether with nodes allows to create drawings with proven quality |4I9| . However, 
if a more complex line style than a straight-line segment is used, it gets very 
hard to describe the edge influence on the node placement. The independent 
edge placement Eig has the advantage of exploiting complex edge routing us- 
ing general-purpose node layout algorithms. This is also the only strategy in 
interactive graph layout where the user specifies the node positions. 

In this paper we follow the second strategy and assume that nodes are already 
positioned by the user or some layout algorithm and we have to route the edges. 

There are several graphical representations of a graph m Nodes are basi- 
cally represented by two-dimensional symbols, most commonly by upright rect- 
angular boxes or circles. In this paper we will use only rectangular boxes. We 
also assume that nodes do not overlap, so there is some space for edge routing. 

The visual appearance of the drawing is mostly influenced by the edge rep- 
resentation. Well-known edge drawing standards are orthogonal, straight-line, 
polyline and curved m Here we focus on curved edges, where the edge is 
represented by a smooth curve between the associated nodes. 

The common approach for curved edge routing is finding a piecewise linear 
path which does not cross node boxes and then smoothing it using a spline. Such 
approach is natural in drawing layered graphs. In layered drawings edges are 
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replaced with polyline paths having bendpoints on every level. This construction 
provides a useful framework for smoothing the polyline paths with splines. 

Such approach is used in the graph drawing program dot |3 where virtual 
node boxes are used to describe the free space in the hierarchical drawing where 
the path can go. At first the calculated polyline path is smoothed with a spline. 
Then if the spline goes outside the virtual node boxes, control points are adjusted 
or the spline is subdivided. In VCG PH Bezier spline is fitted using adaptive 
subdivision, adjusting control points if the bounding triangle of a Bezier spline 
segment intersects a node or a path. 

Dobkin p] describes a more general method applicable to any graph. Again 
the polyline path is calculated first. Then a smooth spline is fitted, which does 
not intersect any obstacle. This method produces nice drawings, but only for 
each path separately. Crossings of several paths are not taken into account. 

Obtaining a smooth path among obstacles is a well-studied problem in robot 
motion planning. Finding a shortest constrained curvature path of a robot in the 
workspace with obstacles is explored in P|. It is shown that such path can be 
constructed from straight-line segments and circle arcs with a constant radius 
corresponding to the maximal allowed curvature. 

Latombe pni suggests a potential field method for path planning. He sim- 
ulates the movement of a particle in the electrostatic field when the particle is 
attracted to the target and repelled from the obstacles. This seems very interest- 
ing approach, but it deals with such physical properties as speed and acceleration 
and application of this method to graph drawing does not seem to be straight- 
forward. 

Sethian [E| develops a general theory called level set methods and shows 
how to apply it to produce a shortest path of a robot within the environment of 
constraints. He reduces the problem to a differential equation and applies the fast 
marching method to solve it. We have found another application of this theory. 
Using Sethian’s approach it is easy to find curves according to specific rules. In 
our case the rules are constructed in a special way to reflect the aesthetics of 
graph drawing and allows finding the path directly, avoiding usage of splines. 

The rest of the paper is organized as follows. In Section 2 we define the 
aesthetic edge routing problem by choosing a cost function and then finding 
the edge drawing according to this cost function. Section 3 gives details how 
to construct the cost function to represent the curved edge drawing aesthetics. 
Section 4 presents an algorithm to route a path according to the given cost 
function. In Section 5 we give the analysis of the results and sketch the possible 
improvements. 



2 Problem Definition 

At first we have to find out how the well-routed edges should look like. There 
are a number of commonly accepted aesthetics criteria for graph drawing |HE1 
E]. We consider the following for curved edge drawings: 
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(1) edges are short, 

(2) edges are well separated from node boxes and other edges, 

(3) edges do not turn sharply, 

(4) edge crossing number is small, 

(5) edges cross in wide angles. 

An intuitive way how to represent these aesthetics is to assign some cost 
h{x,y) to every point (x,y) on the plane expressing how good it is for the edge 
to go through this point. Then the edge is searched as a path between its node 
boxes that has the smallest total cost. For now we can assume that the path 
is joining the node centers (xo,yo) and (a;i,j/i). Formally each individual path 
can be expressed as a curve L with fixed endpoints (xo,yo) and (xi,yi) which 
minimizes the integral 

J h{x,y)dL. (2.1) 

This definition corresponds only to one path, but the aesthetics criteria express 
also relations between several edges. To overcome this limitation we construct 
the paths one by one but incorporating the previously routed paths into the cost 
function. 

At the beginning we have only nodes, so the cost function is calculated for 
them. When a new path is routed the cost function is updated to reflect the 
current configuration. Since we are adding paths one by one the drawing is 
dependent on the order in which they are added. To reduce the sensitivity to 
the order, paths are routed iteratively e.g. after initial routing they are rerouted 
several times by taking one path at a time, deleting it and routing it again. This 
method is summarized in the algorithm routeEdges. 

algorithm routeEdges 

Build the initial cost function corresponding to nodes 
for each path P 
Route P 

Update the cost function with P 
endf or 

for iter = 1 to MAX_1TER 
for each path P 

Delete P from the cost function 
Route P 

Update the cost function with P 
endf or 
endf or 
end 

Two parts of the algorithm need to be explained: how to build the cost 
function, and how to find the path corresponding to the defined cost function. 
These problems are discussed in the following sections. 
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3 Finding the Cost Function 

Let us look how to construct the cost function h{x, y) to incorporate the above 
mentioned aesthetics. We express h{x,y) as a weighted sum of functions, where 
each function expresses the influence of one drawing element (node or path) . We 
add also some constant ci, which expresses the cost of the length of the path. 
The general form of the function is 

h{x,y) = Cl + C 2 '^h{x,y) + p^{x , y) , (3.2) 

i i 

where bi are the functions for nodes and pi are the functions for paths. For nodes 
we want the function to be infinite inside the nodes to prohibit the paths from 
crossing them, and decreasing with the distance from nodes to keep the paths 
some distance apart from them. For paths we want the function to be of some 
finite magnitude near the path and decreasing with the distance from the path. 
Constants C 2 and C 3 are weighting factors for the influence of the nodes and paths 
respectively. We found that setting ci = 1, C 2 = 1.5 and C 3 = 5 is adequate. 

Denoting the distance from a point {x, y) to the drawing element by d{x, y) 
our choice for the function for nodes was 

b(a:,y) = f ' “-e node 

(00 if [x, y) IS inside the node 

In practice the value of bi inside the node is set to a large enough constant for 
paths not to cross the node. This simplifies the algorithm and allows us to route 
the paths between the node centers. 

For paths two kinds of functions were considered: 

P{x, y) = max(l - 0), (3.4) 

a 

d{x,y)^ 

p{x,y) = e (3.5) 

where a controls the desirable separation between paths. In the Fig. 1 we can 
see the drawings of a graph using different cost functions for the paths. Both 
functions give good results, but the second one overall seemed to be a little 
better, so further we will work with it. For the further discussion we must choose 
a good representation of the cost function. We chose the simplest approach to 
represent the cost function by sampling values on the orthogonal grid. Other 
choices are also possible, for example triangular meshes, but that seemed to be 
more complex. 

Let us look how to calculate the cost function on the grid quickly. We have 
to notice that the functions bi and Pi have some influence only in a small neigh- 
borhood near the corresponding object. In other points we can set the function 
to zero without affecting the resulting drawing too much. We need to And all 
the points where the function is large enough. 
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one (6) 



For node boxes this is easy since all such points are contained in a rectangle 
around the node. Size of this rectangle can be easily calculated from the definition 
of bi. 

For paths the situation is more complex, because the path is approximated 
with many short line segments and determination of the distance for the point 
to the path requires testing all the path segments. We use the property that our 
functions are monotonically decreasing with the distance from the object. 

The algorithmcost-function-for-path calculates the function by starting at 
the path and expanding the zone if the function is greater than some threshold. 
For this purpose a priority queue ordered by the function p value is used. For 
each point in the queue the nearest path segment is kept and the function is 
calculated depending only from this one segment. The queue is initialized with 
the midpoints of all the segments. In the loop of the algorithm we get the point 
with the highest function value from the queue and calculate the function for 
its neighbors. If the function is larger than the given threshold for them, we 
put them into the queue. Since the priority queue can be implemented that one 
operation takes O(logn) time 0, it can be easily seen that this algorithm finds 
the correct function in time 0(n log n), where n is the number of the affected 
points. 

algorithm cost_function_f or_path 
for each segment S of the path 
M = midpoint of S 
d = distaince from M to S 

Calculate the function value h depending on the distance d 
Put < M, S,h > into the priority queue 
endf or 

while the queue is not empty 

Get <M,S,h> with the largest h value from the queue 

if h < threshold then terminate 

for each of the four neighbour points N of M 
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if the function has not been calculated for N then 
d = distcince from N to S 

Calculate the function value hi depending on d 
Put < N, S, hi > into the priority queue 
endif 
endf or 
endwhile 
end 

4 Finding the Path 

For the given cost function our task is to find the path between two given node 
centers. The simplest approach would be to search for the shortest path in the 
grid neighbor graph. But this does not give acceptable solution because of the 
orthogonal structure of the graph. We need a continuous, direction independent 
solution approximated on the grid. 

We use a method essentially the same as the Dijkstra’s shortest path search- 
ing ^114. At first the distance map is calculated for all points and then the path 
is determined by tracing backwards from the destination point. But there are 
differences in the distance map calculation and back tracing. In the Fig. 2 we 
can see the cost function and the distance map of the cost function shown in 
Fig. 5. The starting point was chosen in the middle of the map. 





Fig. 2. The cost function (left) and the corresponding distance map {right) plotted 
in three dimensions 



4.1 Calculation of the Distance Map 

The distance map is a function f{x,y) which denotes the distance from the 
starting point (xo,yo) in the environment of varying costs. It is easy to imagine 
the distance map as a pit-like function in three dimensions, where the height of 
the pit surface is the value of the distance map at the given point. It is very 
interesting that this pit is the same as develops when the sand of non-uniform 
structure run out from the box with a little hole at the bottom. If the sand can 
hold at the maximum slope h{x, y) then it is exactly our distance map. Moreover 
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this construction has a prototype in the mining industry when an open pit must 
satisfy the given slope requirements m The distance map f{x, y) satisfies the 
following equation 

\\Vf\\=Hx,y). (4.6) 

This means that the slope of / is our cost function h. After adding the starting 
condition and writing the gradient explicitly the problem can be stated as follows: 




(4.7) 



This equation can be solved with numerical methods by approximating deriva- 
tives with differences. For example the central differences can be used: 



D 

D 



X 

y 



f(x+l,y)-f(x-l,y) 

f{x,y+l}-f{x,y-l) 

2 ■ ’ 



and the problem is transformed to 



(Dl + Dl = h{x,y)^ 

1 f{xo,yo) = 0 



(4.8) 



(4.9) 



The solution can be obtained by iterative solving the equation at each grid point. 
But this is very slow because many iterations are needed. 

Sethian proposes a faster method. He observes that the solution can be 
constructed by propagating the front like in Dijkstra’s shortest path algorithm. 
He uses one-side differences instead of central ones selecting the direction, which 
the front came from. As a result one pass over the grid is sufficient. According 
to Sethian the algorithm works as follows: 



algorithm distanceMap 
for all (x,y) f{x,y) = oo 
f{xo,yo) = 0 

Put (xo,yo) into the priority queue 
while the queue is not empty 

Get and remove the point (x, y) with the smallest / value 
from the queue 

if {x,y) = {xi,yi) then terminate 

for each of the four neighbour points (rt,u) of {x,y) 
if f{u,v) = oo then 

compute value of f(u,v) by selecting the largest solution 
t of the quadratic equation: 

(max(z — min(/(M — 1, v), f{u + 1, w)), 0))^-|- 
(max(z — min(/(u, v — 1), f{u, v + 1)), 0))^ = h{u, u)^ 

Put (u,v) into the priority queue, 
endif 
endf or 
endwhile 



end 
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The algorithm terminates when the priority queue is empty or the destination 
point is reached. In m Sethian proves the correctness of the algorithm as well 
as the time complexity of 0(n log n), where n is the number of affected grid 
points. 



4.2 Back Tracing 



Now when we have the distance map calculated we can search for the path. 
The simplest way is to trace back from the current point to its lowest neighbor, 
starting at the other endpoint of the path. This approach does not find the best 
path because of the two dominant directions imposed by the grid. 

Another way is to walk in the direction of the gradient with a fixed step. 
This approach is rather complicated because a sub-cell accuracy is needed. 

We have implemented a different approach (algorithm findPath) similar to 
the error diffusion paradigm in computer graphics ( 7 ). We walk gridpoint by 
gridpoint like in the first approach, but correcting the direction pointed by the 
gradient. At the current gridpoint we calculate the gradient, normalize it and 
walk to the neighboring gridpoint, which is the closest to the gradient. Then the 
error is calculated by subtracting the gradient vector from the direction moved. 
This error is added to the gradient for the next point. 



algorithm findPath 
error = 0 

P = (xi,yi) 
while P ^ {xo,yo) 
add P to the path 

calculate the gradient G at the point P 

G=G / |G| 

G = G + error 
V = P+G 

P = the closest gridpoint to V 
error = V — P 
endwhile 
end 



The walk always terminates in the starting point because from its construc- 
tion f(x,y) contains only one minima at the starting point. This method gives 
us the exact curve approximated on the grid as a chain of gridpoints. In the Fig. 
3 we can see the paths generated by the lowest neighbor walk and the algorithm 
findPath using unity cost function. It can be seen that the latter produces a per- 
fectly straight line on the grid. Fig. 4 shows the path corresponding to a realistic 
cost function. 
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Fig. 3. Paths generated by the 
lowest neighbor walk (thin) and 
the algorithm findPath (thick) 
using the unity cost function 




Fig. 4. A distance map and a path cor- 
responding to it 



To reduce the memory required for the path representation the path is ap- 
proximated by a polyline. A simple algorithm is used where the sequence of 
gridpoints is approximated by a line segment if all points approximately fall on 
a common line. This algorithm has a quadratic worst-case complexity but in 
practice it is very fast in comparison with other parts of the program. 



5 Implementation and Results 

We have implemented the curved path router as a separate program, which takes 
graph description as an input and routes all the paths. The input must contain 
node positions and sizes and the edge connectivity information. 

In Fig. 5 to Fig. 8 we can see some drawings produced by the proposed 
edge placement algorithm. The drawings of the edges appear natural, easily 
perceptible for the user. The drawing conforms to all the mentioned aesthetics. 
The relations between several paths are represented correctly, edge separation 
is good and edges cross in wide angles, what is hard to achieve with previously 
known approaches lanini- 

The desired behavior is easily customizable by adjusting the weights for nodes 
or paths, or changing the cost function to incorporate new aesthetics. 

Sometimes when the node is large the paths coming from node center leads 
to uneven distribution of the edges along the node border. To remedy this we 
could alternatively modify the distance map calculation by starting at all the 
border of the start node and ending when the border of the destination node is 
reached. 

Our implementation of the proposed edge routing approach is not fast. For 
example it took 12 seconds to route the edges of the graph shown in the Fig. 5 
on a Pentium III 800 MHz computer. The most expensive part is the distance 
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map calculation. We have observed that its time complexity of routing one edge 
is approximately 0{L^ logL), where L is the length of the path in grid units. 

The speed of the program was not the goal of this research, so there is much 
room for optimizations. The grid step has the greatest impact on the running 
time, so it is crucial to choose the right value to compromise the quality with 
the running time. We chose the grid size equal to the pixel size on the screen 
for the best visual quality. However we believe that more sophisticated choices 
dependant from the characteristics of the graph are possible. 

Another improvement could be propagating the front only in the most 
promising directions. However our first experiments using goal directed search 
P failed, because in Sethian’s approach the propagation must be carried out 
in a strict distance ordering. 




Fig. 6. 
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The number of iterations also plays a significant role. In the most cases 3 to 

5 iterations are sufficient, so in our experiments we always make 5 iterations. 

Some adaptive criteria could do better. The initial ordering of the paths is also 

important. May be it would be worth routing the shortest paths first. Thus we 

might reduce the iteration count and also the crossing number could be smaller. 
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Abstract. An exact pattern matching problem is to find all occurrences 
of a pattern p in a text t. We say that the pattern matching algorithm is 
optimal if its running time is linear in the sizes of t and p, i.e. 0{t + p). 
Perhaps one of the most interesting settings of the pattern matching 
problem is when one has to design an efficient algorithm with a help of 
small extra space. In this paper we explore this setting to the extreme. We 
use an additional assumption that the text t is available only in a com- 
pressed form, represented by a straight-line program. The compression 
methods based on efficient construction of straight-line programs are as 
competitive as the compression standards, inclnding Lempel-Ziv’s com- 
pression scheme and recently intensively studied compression via block 
sorting, due to Burrows and Wheeler. Our main result consists in solving 
compressed string matching problem in optimal linear time when only 
a constant size of extra space is available. We also discuss an efficient 
implementation of a version our algorithm showing that the new concept 
may have also interesting real applications. 



1 Introduction 

The importance of data compression methods and their use on everyday basis 
have been steadily growing over the past few decades. Especially the last ten 
years - due to unprecedented popularity of the Internet and the World Wide 
Web - bring enormous (still exponential) increase of electronic data available. 
This phenomenon creates a new challenge in the fields of text algorithms and 
data compression - a need for new tools that allow to store data in a succinct 
form that preserves fast and easy access. The string/ pattern matching problem, 
where one is interested in finding one/all appearances of a pattern p in the text 
t (usually much larger), is a key problem in efficient text searching. The running 
time of a string matching algorithm is very often expressed in terms of a number 
of comparisons between the pattern and the text symbols. We say that the run- 
ning time of a string matching is optimal if it’s linear in the sizes of the text and 
the pattern. A number of optimal 0(t -I- p)-tim^ string matching algorithms has 

^ We allow names of strings to be used in the context of asymptotic notation, e.g. 
0{w) stands for functions that are linear in the length of string w, however if w 
stands alone we use a standard notation |ui| to denote its length. 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 138-^^ 2001. 

@ Springer- Verlag Berlin Heidelberg 2001 
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been introduced both in the theoretical as well as in a practical setting, 0 . Many 
aspects of efficient textual search has been investigated, but perhaps one of the 
most interesting settings is when one has to design an efficient algorithm with a 
help of no (or very small) extra space. Galil and Seiferas observed in that the 
string matching problem can be solved in linear time while using only additional 
memory of a constant size. Later work in this field was focused on determining 
an exact complexity of constant space string matching. The running time of a 
constant space string matching was first estimated by Crochemore and Perrin 
1^. Their searching procedure performs at most 2\t\ comparisons for linearly or- 
dered alphabets. An alternative algorithm designed for general alphabets with 
complexity (2 -|- e)|t| was later presented by Gasieniec, Plandowski and Rytter 
in P]. The first small space algorithm which beats the bound of 2|t| comparisons 
was presented by Breslauer in |3j. He designed a variant of Grochemore and 
Perrin algorithm that uses at most (| -|- e)|t| comparisons while preserving con- 
stant additional space. In nm Gasieniec et. al. presented very simple constant 
space string matching algorithm with searching stage performing at most 2n 
comparisons and more complicated one requiring at most (l-|-e)|t| comparisons. 
In this paper we explore constant space string matching to the extreme. We use 
an additional assumption that the text t is available only in compressed form. 
Under this assumption we deliver constant space string matching algorithm that 
works in time 0{t). We didn’t intend to study an exact constant of our linear 
solution. Our intention was rather to show that such a linear algorithm exists. 

The problem of searching in compressed files has been studied recently quite 
intensively. The focus was on the searching time and most of the algorithms 
used at least linear additional space. Several efficient solutions were designed 
for LZ/LZW-compressed files, see e.g. [Util I Ij . There was some interest in com- 
pressed pattern matching for straight-line programs too, however first solutions 
required linear memory in the size of a compressed file too, see e.g. [Itij . Only 
recently more space competitive algorithm that uses only linear (in the size of 
a dictionary) extra space was presented in [20] . In contrast, our algorithm re- 
quires only constant extra space and it is independent on the size of dictionary. 
Also very recently Amir et al. in |2j stated problem of the compressed pattern 
matching in small extra space. They propose time/space efficient pattern match- 
ing algorithm that works for run-length encoding. Their algorithm (designed for 
2d-pattern matching) works in linear time and requires only linear space in the 
size of a compressed pattern. Their result is possible due to the fact that run- 
length coding is extremely simple, but unfortunately it doesn’t give such a good 
compression as the other methods do. Another interesting aspect of the com- 
pressed search has been recently studied by Ferragina and Manzini, see 0. They 
have delivered a novel data structure for indexing and searching with a space 
occupancy being a (linear) function of the entropy of the underlying data sets. 

The rest of the paper is organised as follows. In section 0 we recall some 
basic string definitions as well as we expose more details on data compression 
via straight-line programs. In section 0 we show how the actual text search 
is performed. We first show how to traverse straight-line programs and then 
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how to combine that with an on-line efficient search. In section 0we discuss an 
implementation of our algorithm as well as the results of several experiments. 
We conclude (section O with several remarks on the subject. 

2 Preliminaries 

We start with some formal definitions. Let s =< s[l], s[2], .., s[m] > be a string 
over an alphabet S with lifl > 2. We shall frequently denote s by s[l,..,m]. 
An initial segment s[l, for i = is called a prefix of string s, and 

final segment s[j, .., m] is called a suffix of s. A period of s is a positive integer q 
such that for any i, 1 < i < m — g, s[f] = s[i -I- g]. The shortest period of string 
s is called the period of s and we will denote it by per{s). String s is called 
primitive iff per(s) = m. String s is called periodic if per(s) < otherwise it is 
called non-periodic. Assume that string s is non-periodic, but it has a periodic 
prefix 7T with the period q. A pair (a, 6) that breaks period g is a left most pair 
of positions in s, s.t. b — a = q and s[a] ^ s[6]. 

Example 1 String abracadabra has two periods: 7 and 10, and per (abracadabra) 
= 7. On the other hand, strings igor and leszek are primitive and their periods 
are 4 and 6, respectively. 

Given two strings built over alphabet A: pattern p = p[l,...,m] and 
text t = t[l,...,n\. We say that pattern p occurs at position i in text t if 
p[j] = p[i -|- g — 1], for all j = 1, .., m. A string matching problem is to find all 
occurrences of pattern p in text t. 

Definition. A straight-line program (SLP) i? is a sequence of assignment state- 
ments: 

Xi := expri; X 2 := expr 2 ] • • •; := expr„; 

where Xi are variables and expri are expressions of the form: expri is a symbol 
of a given alphabet A, or exprri = • Xj^ ■ for some 1 < I < k, ji < i, 

where A • Y denotes the concatenation of A and Y. 

In other words SLP is a simple context-free grammar with a set of pro- 
ductions of the form: Xi — )> Ag^ . . . Ag^, for some 1 < I < k, ji < L As in 
context-free grammars each SLP has a starting nonterminal that stands for a 
whole string generated by the SLP. 

Example 2 SLP for Fibonacci words: 

F(n) ^ F(n - 1) ■ F(n - 2) 

F(n - 1) ^ F(n - 2) • F(n - 3) 

F(3)^F(2)-F(1), F(l) = a, F(2) = b 
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F<6) 




F<5) F<4) 




F{4) F(3) F(3) F(2) 

AAAI 



F(3) F(2) F(2) F(l) F(2) F(l) b 

/\ 1 1 1 1 1 




Fig. 1. SLP for 6th Fibonacci word. F(6) is a starting nonterminal. 

2.1 Compression via Straight-Line Programs 

There are several compression methods based on construction of simple deter- 
ministic grammars, e.g. see The crucial idea used here is to find the smallest 
possible set of productions, a dictionary, that generates a source string. 

In this paper we adopt a recursive pairing scheme that generates 
relatively small set of productions. Assume that initially a source (to be 
compressed) string s G A*, where A is a source alphabet, and dictionary T> is 
empty. At any stage of the compression process alphabet A is extended to A 
and it is used to encode a current version S of the source string. We start each 
stage by finding the most frequent pair of neighbouring symbols (a, b) in S. 
We replace each occurrence of (a, b) by a new symbol a, we store production 
a — >■ (a, b) in dictionary V, and we extend alphabet A = AU {a}. We repeat this 
process as long as it makes sense, e.g. when it still leads to a better compression. 
When this process is finished both dictionary V and alphabet A are encoded 
(using e.g. arithmetic coding) to minimize the size of the compressed source 
string. The compressed representation of an input string is formed by a triplet 
{V,A,S), where V is the dictionary (i.e. set of SLP productions), A is the 
extended alphabet and 5 is a compressed string built over extended alphabet 
A. Note that cardinality of dictionary V is not greater than cardinality of 
extended alphabet A. 



Example 3. A compression of abbababbabbababbababbabbababbabba G {a, 5}*. 



string S 


new production 


alphabet A 


abbababbabbababbababbabbababbabba 




'S' 

e 

II 

II 


AbAAbAbAAbAAbAbAAbAba 


A ^ ab 


A = {a, b, A} 


BABBABABBABBa 


B Ab 


A = {a, b, A, B} 


CBCCBCBBa 


BA 


A = {a, b, A, B, C} 


DCDDBa 


D^CB 


A= {a,b,A,B,C,D} 



A compressed representation of string abbababbabbababbababbabbababbabba is a 
triplet (T>,A,S), where dictionary T> = {A — ?> ab,B — >■ Ab,C — ?> BA,D — ?> 
CB}, extended alphabet A = {a, b} U {A, B, C, D}, and compressed string S = 
DCDDBa. 
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Another compression method based on generation of SLPs has been recently 
introduced by Nevill-Manning and Witten, see m- Their algorithm traverses 
a source string from left to right introducing a new production as soon as a 
frequency of some pair passes a certain threshold. The advantage of the new 
approach is better compression ratio and faster compression time, though the 
final size of the dictionary is unpredictable. While in the recursive pairing method 
we could stop creation of new productions at any time, e.g. when we were happy 
with the achieved compression ratio or when further processing doesn’t lead to 
a better compression. Similar control in Nevill-Manning and Witten’s algorithm 
must be more complicated or perhaps even impossible. 

3 Space Efficient Compressed Text Searching 

As the input our searching algorithm gets a dictionary (sequence of productions) , 
compressed version of text t, and pattern p. As the output the algorithm reports 
all occurrences of pattern p in text t. Our solution is a combination of space 
efficient traversing of directed acyclic graphs (dags) and time efficient constant 
space string matching. We start with a short reminder of Schorr-Waite algo- 
rithm m and then we show how to use it in the context of time/space efficient 
searching in SLPs. 

3.1 Schorr-Waite Algorithm 

A Schorr-Waite algorithm uni was primarily used to traverse directed graphs in 
the presence of small extra memory (one bit per node of the graph). However 
for some certain classes of graphs like trees and directed acyclic graphs (dags) 
the traversal can be performed in place, i.e. with a help of a constant size extra 
memory (i.e. few additional registers). The concept of their algorithm is very 
simple and it is based on reversing directions of edges in the graph. We give 
here a simple illustration of Schorr-Waite algorithm on the example of directed 
binary trees, see Figure |3 




I 





D 



♦ 







D 



Fig. 2. Full round of Schorr-Waite algorithm 
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3.2 Traversing SLPs 

A straight-line program (SLP) can be represented as a dag G, where all nodes 
of out-degree > 0 stand for nonterminals (new symbols of extended alphabet A) 
and all nodes of out-degree 0 stand for terminals (symbols in original alphabet 
A). All productions are determined by nonterminal nodes and their outgoing 
edges. An SLP can be also expressed as a (usually much larger) rooted tree 
T with the root labeled by SLP starting nonterminal, see figure [D A string 
generated by the SLP can be formed by traversing rooted tree T in infix order 
and reporting content of consecutive leaves in T. 

The idea of Schorr- Waite traversing strategy is based on manipulation of 
pointers (dags edges). However in case of an SLP each pointer is represented 
as an element of some production, thus simple manipulation on pointers has to 
be replaced by more complicated manipulation on SLP productions. In what 
follows we show how such manipulation can be performed efficiently. 

Definition. An SLP configuration is a tuple K = (S,p), where S = (si, . . . , Sk) 
is a set of productions and p = (A, b) is an ordered pair of SLP symbols, where 
symbol X stands for current SLP starting nonterminal. 

Definition. An initial SLP configuration is a tuple Kq = {So,po), where Sq 
is an SLP that generate a source string and po is an ordered pair formed by 
the leftmost nonterminal symbol in the starting SLP production and a special 
symbol #. 

Definition. Let K = (S,p) and K' = {S',p') be two SLP configurations. Then 
configuration K' is reachable from configuration K, we write K ^ AT', iff one of 
the two conditions holds: 

— p = (Xi,b) and there is production Si in configuration K of the form 
Xi ^ A\ . . .Ak, then K' is obtained from K by changing production s'i 
to Ai — )> A 2 . . . Xk-i,b and p' to (Ai, A^), 

— p = (A, B) and there is no production in configuration K of the form 
A — >■ Ai . . . Afe, then S = S' and p' = (B, A). 




A2 A3 A4 Ak A3 A4 Ak X 



Fig. 3. Traversing by changing configurations 



144 L. Gasieniec and I. Potapov 



Example 4. Sequence of configurations in Figure 0 
{A ^ BC,B^ DE\A#) ^ {A ^ C#, B -)■ DE\BA) ^ 

{A C#, B EA\DB) ^ {A C#, B EA\BD) => 

{A C#, B AD\EB) ^ {A ^ C#, B AD\BE) => 

{A C#, B DE\AB) ^ (A ^ B DE\CA) ^ 

U ^ #S, B DE\AC) ^ {A ^ BC,B ^ DE\#A) ^ 

{A^ BC,B ^ DE\A#) 

The following lemma holds. 

Lemma 1. For any SLP that generates string of length n and each production 
is of size k there exists a unique sequence of configurations Kq, Ki, . . . , Kr, s.t. 

— Kq => Ki => . . . Kq, where Kq is initial configuration, where 

- r=(|^+2)-n-|^. 

Corollary 1. The number of reachable configurations r < 5 • n — 3, for any 
k>2. 

The algorithm traversing SLPs via configurations is as follows: 

(1) Procedure SLP-traverse 

(2) K — Kq] {set initial configuration} 

(3) repeat 

(4) if ( A is a non-terminal symbol ) then 

(5) replace in current configuration K production 

(6) Si = A ^ Ai . . . Ai~ by ^ ^ A 2 ■ ■ ■ X)^_iB 

(7) and pair p' = {A, B) by {Ai, A); 

(8) else {A is a leaf of the tree} 

(9) report nonterminal A and replace in configuration 

(10) pointer (A,B) by (B,A)] 

(11) until ((top of iF) = # ) 

Since the length of a sequence of reachable configurations Kq => Ki ^ 

Kr Kq is at most 5n — 3, each production can be accessed in time 0(1) (all 
productions are of the same size k and they are stored in alphabetic order), and 
a number of modifications associated with a change of a configuration is also 
0(1), the total running time of SLP-traverse is linear, i.e. 0(n). 

Theorem 1. We can traverse SLP-compressed text t reporting consecutive sym- 
bols in t with a help of constant extra space in time 0(t). 

Recall that compression by recursive pairing results in factorization of source 
string t into substrings ti,t 2 , ■■,ti, s.t. t = tit 2 -.ti where each ti, for i = l,..,l 
is encoded by some SLP. Since each ti can be traversed in time 0(<i), the total 
traversing time is bounded by 0{ti) = 0{t). 

We have just shown that we can traverse consecutive symbols of a compressed 
string in linear time. In the next section we will show how this traversal can be 
combined with a constant space string matching. 
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3.3 Constant Space String Matching 

We start with a short reminder of Knuth-Morris-Pratt (KMP) algorithm. The 
KMP algorithm inspects text symbols from left to right updating information 
about the longest currently recognised prefix of the pattern. If the next text 
symbol is the same as the next pattern symbol, the text symbol is consumed and 
currently recognised pattern prefix is extended. Otherwise (a mismatch occurred) 
the appropriate shift (of the pattern) is made and the size of recognised pattern 
prefix is decreased respectively with a help of a failure function, see e.g. US!. 

In our setting KMP algorithm has a straightforward implementation assum- 
ing that we can use 0(p) space to store the values of the failure function. However 
the task here is to design an algorithm that uses only a constant extra space. 
The main idea is to mimic the performance of KMP algorithm by efficient al- 
gorithmic implementation of the failure function. A similar concept of a use of 
approximate failure function was proposed earlier by G^sieniec et al. in m- 

The computation of a shift requires a use of a constant space string matching 
algorithm. We use here sequential sampling algorithm cni. However the sequen- 
tial sampling requires a preprocessing when one has to find the longest periodic 
prefix of the pattern including a pair (of symbols) that breaks this period (if 
any). Since our algorithm have no preprocessing at all, the computation of the 
longest periodic prefix has to be done on-the-fly. 

The concept of our algorithm is as follows. The algorithm works in stages. 
Each stage corresponds to an attempt to recognition of a pattern prefix of 
size 2^, for k = l,..,logm. We will use the following invariant. We enter some 
stage when a successful recognition of prefix tt^ is completed. We also assume 
that in the beginning of any stage we know what is the longest periodic prefix of 
TTfe, what is its period, and what is the leftmost pair of symbols that break this 
period. 

Any stage starts with a naive comparison of consecutive symbols in tt^+i (fol- 
lowing an occurrence of tt^) with corresponding symbols in text t. If all symbols 
in TTfe+i are compared with a success, we enter the next stage with a larger prefix 
to be found. Otherwise the mismatch /i (between the text and prefix symbols) is 
found and we do the appropriate shift updating information about current TTfc. 

In what follows we assume that pattern p is non-periodic to make the pre- 
sentation clearer. However the algorithm can be easily implemented for periodic 
patterns applying standard trick of looking for a sequence of consecutive occur- 
rences of its longest non-periodic prefix, see e.g. uni. 



(1) Procedure SM; 

(2) TTfc ttq; i <— 1; 

(3) if TTfc+i = t[i, ,.,i+ \nk+i \ - 1] then 

(4) if TTfc+i = p[l , .., m] then 

(5) report occurrence of p at f; z ^ i -I- tt^ ^ tto; GOTO (3) 

(6) else TTfc ^ TTfc+i; Process(Trfc); GOTO (3); 

(7) else { mismatch p, has been found} 
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(8) if TTfc is non-periodic then 

(9) i^z+^;7rfc^^o;GOTO(3) 

(10) else { prefix has a short period q } 

(11) if mismatch breaks periodicity q then 

(12) i ^ i -I- |7Tfc| - g; TTfc ^ 7To; GOTO (3) 

(13) else i^i + q GOTO (3) 

Procedure Process(7Tfc) computes the longest periodic prefix of 1:^1 its period 
q, and a pair of symbols that breaks this period (if any) under inductive assump- 
tion that the same information is available for prefix iTk-i (condition required 
to enter a new stage). There are two cases. If prefix iTk-i is non-periodic we 
search for all occurrences of TTk-i in tt^, using e.g. sequential sampling, P|. A 
constant number of occurrences of iTk-i (tt^-i is non-periodic) determine a con- 
stant number of candidates for the longest periodic prefix of . We check all the 
candidates naively in time 0(7Tfe). If prefix iTk-i is periodic with a period r we 
extend this period as far as we can in i.e. until the breaking pair of symbol 
is found or the end of is reached. Since operation is done in time 0(7Tfc), the 
running time of procedure Process(7Tfc) is also bounded by 0(7Tfc). 

Theorem 2. There is a eonstant spaee SLP-eompressed pattern matching that 
works in time 0{t). 

Proof. We prove here that algorithm SM works in time 0{t). We use amortisation 
argument, i.e. we show that the time c spent (number of symbol comparisons) 
in each stage, is eventually amortized by a shift of size 17(c). And then since the 
total sum of shifts is bounded by the length of the text t the running time of the 
algorithm is bounded by 0{t). In each stage the cost of algorithm SM can be 
estimated by analysis of one of several cases. During a stage that we enter with 
prefix TTfe already recognised we use an invariant that the time spent (so far) to 
process prefix tt^, is 0{-Kk). The following holds 

1. If prefix TTfc+i is recognized successfully and iTk+i = P (line 5), we report the 
occurrence of pattern p and we do the shift of size . The work determined 
by earlier processing of and naive test of symbols in Hk+i is bounded by 
0(7Tfc+i) = 0{p). The shift is of size I7(p), thus the amortisation argument 
is fine, see Figure Et. 

2. If prefix TTfc+i is recognized successfully and iTk+i P, we go to the next 
stage with TTfc ^ r^k+i- The total cost of processing is 0{TTk+i), which is fine 
with our invariant, see Figure Through the remaining cases, we work 
under assumption that prefix iTk+i has not been recognised, i.e. a mismatch 
p, has been found. 

3. If prefix TTfc is non-periodic; the processing of and a naive check of sym- 
bols of 7Tfc_|_i is bounded by 0{-Kk). Since we do a shift of size l7(7Tfc) the 
amortisation argument is fine, see Figure it. 

4. If prefix TTfc is periodic with the period q, but mismatch p breaks this period; 
the processing of and a naive check of symbols of tt^+i is bounded by 
0{'Kk)- Since we do ashift of size \Tik\ — q = l7(7Tfc) the amortisation argument 
is fine, see Figure EJi. 
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5. If prefix tt^ is periodic and mismatch fi continues periodicity q, we do a shift 
of size q that amortises cost of lost first q symbols in TTfc and we stay with a 
long recognised prefix in the same stage. Thus the amortisation argument is 
fine in this case too, see Figure^. 
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Fig. 4. Five cases: a,b,c,d,e (starting from the top) 



4 Experimental Work 

In this section we discuss efficient implementation of KMP algorithm in SLP- 
compressed texts. We have made several experiments with SLP extended alpha- 
bets of bounded size. In particular we were interested in alphabet sizes 256 and 
1024, see Tables 1 and 2. We have used three test files: dna. human of the Biomed- 
ical Corpus, and two arbitrary files of the Large Canterhery Corpus: Bible.txt 
and kjv.gutenberg. 



Table 1. Compression efficiency 





original text 


PE-256 


PE- 1024 


Gzip 


dna.human 

Bible.txt 

kjv.gutenberg 


3170608 

4047392 

4846137 


1178871 

2042252 

2343010 


1047300 

1539550 

1781395 


968094 

1176856 

1416051 
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Our simple implementation of recursive pairing compression algorithm (with 
alphabets of size 256 and 1024) is almost as good as compression done by UNIX 
gzip command (based on Lempel-Ziv method fl5| . |21|1 for file dna. human, and 
slightly worse for files Bible.txt and kjv.gutenberg. Note that recursive pairing 
method deals usually with much larger alphabets that lead to more efficient 
compression. 

We have also implemented a version of algorithm KMP that works for SLP- 
compressed texts. Algorithms PE-256 and PE- 1024 solve compressed string 
matching problem for alphabets of size 256 and 1024 respectively. Table 2 wit- 
nesses their performance in comparison with algorithm KMP (running time of 
PE-*/running time of KMP) that works with uncompressed files. We used the 
same source files as in Table 1. You can see here a difference between searching 
files via remote access (i.e. access to files stored in our global UNIX system) and 
files stored on local disks of our Linux workstations. In case of a remote access, 
algorithm KMP spends a lot of time on accessing large (uncompressed) UNIX 
files from a global disk. In this case a speed-up achieved by searching compressed 
(much smaller) files is quite large (4 times). In case of an access to a local disk 
the speed-up is less impressive however searching time in compressed file is still 
2 times faster then a use of a straightforward implementation of algorithm KMP 
in uncompressed files. Notice also that searching in compressed files with larger 
alphabet is slower. This is a part of a time/space trade-off, i.e. use of a larger 
alphabet leads to a better compression but more complex SLPs, that are to be 
traversed by the string matching algorithm. 

Table 2. Searching speed-up 





Remote access 


Local Hard-Drive 




PE-256 


PE-1024 


PE-256 


PE-1024 


dna. human 


21.49% 


24.29% 


50.16% 


56.66% 


Bible.txt 


14.21% 


18.93% 


44.28% 


54.27% 


kjv.gutenberg 


14.01% 


18.35% 


43.75% 


53.75% 



5 Conclusion 

We have introduced first optimal linear time constant space compressed string 
matching algorithm that works for SLP-compressed files. The choice of straight- 
line programs (SLPs) as a compression method is due to their compression effi- 
ciency and suitability for a compressed search. We performed also a number of 
experiments showing that KMP algorithm performed in compressed environment 
and constant extra memory is superior to KMP algorithm run on uncompressed 
text. Our algorithm has also very straightforward application in other settings 
of a string matching problem. For example, if we allow to use 0{p) extra mem- 
ory we can easily implement 0(tp)-time standard algorithm solving approximate 
pattern matching with edit distance. We believe that this could be very useful 
in many applications dealing with huge biological data, and where the space 
constraints are critical HZ!. 



Time/Space Efficient Compressed Pattern Matching 149 



References 

1. A. Amir, G. Benson, and M. Farach, Let sleeping files lie: Pattern matching in 
Z-compressed files, Proc. of 5tb Annual ACM-SIAM Symposium on Discrete Al- 
gorithms, January 1994. 

2. A. Amir, G.M. Landau, and D. Sokol, Inplace Run-Length 2d Compressed Search, 
In Proceedings of 11th Annual ACM-SIAM Symposium on Discrete Algorithms, 
SODA’2000, San Francisco, pp. 817-818. 

3. D. Breslauer. Saving comparisons in the Crochemore-Perrin string matching algo- 
rithm. Theoretical Computer Science, 158(1-2):177-192, May 1996. 

4. M. Crochemore and D. Perrin. Two-way string-matching. Journal of the ACM, 
38(3):651-675, July 1991. 

5. M. Crochemore, W. Rytter, Text algorithms, Oxford University Press, 1994. 

6. M. Farach and M. Thorup, String Matching in Lempel-Ziv Compressed Strings, 
Proc. 27th ACM Symposium on Theory of Computing, pp. 703-713, 1994. 

7. P. Ferragina and G. Manzini, Opportunistic Data Structures with Applications. 
Proc. 41st IEEE Symposium on Foundations of Computer Science, (FOCS’OO). 
Redondo Beach (CA), 2000, pp. 390-398. 

8. Z. Galil and J. Seiferas. Time-space-optimal string matching. Journal of Computer 
and System Sciences, 26 (3): 280-294, June 1983. 

9. L. Gasieniec, W. Plandowski, and W. Rytter. The zooming method: a recursive 
approach to time-space efficient string-matching. Theoretical Computer Science, 
147(l-2):19-30, August 1995 

10. L. Gasieniec, W. Plandowski, and W. Rytter. Gonstant-space string matching with 
smaller number of comparisons: Sequential sampling. In Proc. of 6th Combinatorial 
Pattern Matching, LNGS 937 , pages 78-89, Espoo, Finland, July 5-7, 1995., 

11. L. Gasieniec and W. Rytter. Almost optimal fully compressed pattern matching. In 
Proceedings of Data Compression Conference (DGG’99), Snowbird, March 1999. 

12. J.G. Kieffer, A Survey of Advances in Hierarchical Data Gompression, Technical 
Report, Department of Electrical & Gomputer Engineering, University of Min- 
nesota, 2000. 

13. D. Knuth, J. Morris, and V. Pratt, Fast pattern matching in strings, SIAM J. on 
Computing, 6 (1977), pp. 323-360. 

14. N.J. Larsson, Structures of String Matching and Data Compression. Ph.D. Disser- 
tation, Dept, of Computer Science, Lund University, Sweden, 1999. 

15. A. Lempel and J. Ziv On the complexity of finite sequences, IEEE Transactions 
on Information Theory, pp. 22:75-81, 1976. 

16. M. Miyazaki, A. Shinohara, and M. Takeda, An Improved Pattern Matching for 
Strings in Terms of Straight-Line Programs, Journal of Discrete Algorithms, Vol. 
1(1), pp. 187-204, 2000. 

17. L. Mouchard, Presentation at London Algorithms Workshop, LAW’2000, King’s 
College London. 

18. C. Nevill-Manning and I. Witten, Identifying Hierarchical Structure in Sequences: 
A Linear-Time Algorithm, Journal of Artificial Intelligence, Vol. 7, pp. 67-82, 1997. 

19. H. Schorr and W.M. Waite, An Efficient Machine-Independent Procedure for 
Garbage Collection in Various List Structure, In CACM 8(10), August 1967. 

20. Y. Shibata, T. Kida, S. Fukamachi, M. Takeda, A. Shinohara, T. Shinohara, Speed- 
ing up pattern matching by text compression. In Proceedings of 4th Italian Con- 
ference on Algorithms and Complexity, CIAC 2000, March 1-3, 2000 Rome, Italy. 

21. J. Ziv and A. Lempel, A universal algorithm for sequential data compression, IEEE 
Transactions on Information Theory, pp. IT-23(3):337-343, 1977. 




Modelling Change with the Aid of Knowledge 

and Time 



Bernhard Heinemann 



Fachbereich Informatik, FernUniversitat Hagen, 
PO Box 940, D-58084 Hagen, Germany 
bernhard.heinemann@fernuni-hagen.de 



Abstract. This paper is about a formalism describing the change of a 
given set in the course of time. Starting at the Halpern-Moses semantics 
of evolving knowledge in distributed systems and restricting attention to 
synchronous ones, then the knowledge state of an agent having a part 
in the system represents a paradigm for such a changing set, and also 
a guide to our modelling attempt. We develop an appropiate language 
of change and axiomatize the set of theorems of a corresponding logic. 
Afterwards, we are concerned with the basic properties of the resulting 
system: semantic completeness, decidability, and complexity. It turns out 
that simplicity of the facts is reflected as simplicity of the system, in a 
sense. 



1 Introduction 

Temporal logic has turned out to be one of the long-lasting and most promi- 
nent logics in computer science. The main reason for this is that temporal logic 
provides a basic and flexible formalism for modelling and reasoning about con- 
currency. Both linear time and branching time systems have actually proved 
their usefulness concerning these tasks in many respects. Let the textbooks of 
Manna and Pnueli, m and witness this in place of the waste literature on 
that subject. The flexibility of linear time temporal logic, for instance, can be 
justified in its ‘readiness for synergy’. In fact, this logic acts effectively in combi- 
nation with different approaches to modelling non-sequential scenarios. Let us 
mention two corresponding examples, which are rather remote from each other: 
the integration of partial orders, cf PP , and the point of view of reasoning about 
knowledge, cf mg. 

Especially the second field deserves attention presently since it is closely 
related to the topic of this paper. Halpern and Moses have reasonably shown 
how the notion of knowledge may help to attack certain problems arising in 
connection with distributed systems, in their award-winning article cited above 
(Godel Prize 1997). For our purposes it is sufficient to have a quick glance at 
this notion, which is done in the following. 

A certain kind of knowledge can be ascribed to each processor in a distributed 
system, or, more generally, to every agent A involved in a multi-agent system. 
This knowledge coincides, by definition, with the set of formulas being valid in 
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every state of the system the agent considers possible. The relation of possiblity 
between states, Ra, occurring here throws the bridge to knowledge as a term 
of philosophy, investigated by Hintikka in m- In connection with distributed 
systems, however, possibility specializes in an equivalence relation meaning in- 
distinguishability of states to the agent. Thus every equivalence class represents 
some knowledge state of A about the system. There is a corresponding logic of 
knowledge, which may be identified with the multi-modal system S5m in case of 
m agents; see 0. Now, combining S5m and linear time temporal logic yields in 
particular a language which is expressive enough to treat interesting aspects of 
the development of knowledge formally. For example, the notions of synchronous 
systems, perfect recall, and no learning, have been studied to a certain extent 
in the literature; see m3 and 0, e.g. Subsequently we confine ourselves to such 
‘synchronous systems’, where the agents have access to a common clock. Then, 
roughly speaking, ‘perfect recall’ means a successive shrinking of the agent’s 
knowledge state, while ‘no learning’ means increasing of this set in the course of 
time. 

One would expect that, concerning the agent A, phases of learning, i.e., de- 
crease of the knowledge state (corresponding with ‘perfect recall’), and phases of 
forgetting, i.e., growth of the knowledge state (corresponding with ‘no learning’), 
generally alternate. It is exactly this alternating behaviour of a given set which 
we are going to model in the present paper. With this aim in view we have to 
alter linear time temporal logic appropriately. Moreover, we have to take into 
account the way of how the temporal and knowledge connectives of the com- 
bined languages interact with respect to the just indicated semantics. All that 
will be mirrored in the new language and the formal system to be defined in the 
technical part. 

There is a further aspect worth to be mentioned here. Disregarding the knowl- 
edge context we obtain a logical framework modelling qualitatively the temporal 
change of sets. This might be of interest to people working on (spatio-)temporal 
databases or corresponding reasoning formalisms, for instance. In fact, enlarge- 
ments of the present system can provide the theoretical basis for such tasks. For 
now we content ourselves with these hints. We believe that enough reasons have 
been given showing that the topic of the paper may be of interest to a wider 
audience from different computer science communities. 

Logics of changing sets and knowledge states, respectively, emerge from 
modal logics of topology. The paper 0] is the fundamental one for that. As 
to the class of trees studied from a topological and a modal point of view at 
the same time we refer to the paper jSj. Relevant to the current situation we 
move to a temporal setting. The papers IE! and PI can be viewed as predeces- 
sors of the present one, in a sense. Dealing satisfactorily with linearity, i.e., the 
linear structure of sets with respect to time, makes up the improvement com- 
pared with and considering full linear time temporal logic instead of only 
its nexttjm e-fragment goes beyond P. 

We divide the paper into three parts. First, we treat shrinking sets. We 
introduce the syntax and semantics of a corresponding language, axiomatize the 
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set of ‘theorems of shrinking’, sketch briefly how semantic completeness of this 
axiomatization can be proved, and discuss effectiveness properties. It turns out 
that the issues of this part already give hints to what can be expected in case of 
increasing sets, and for the whole system as well. So, our exposition is somewhat 
more detailed here. Afterwards, in the second part, we proceed in a similar way 
with respect to growing. Finally, we glue together both parts suitably. 

We assume acquaintance of the reader with basic modal and temporal logic. 
All that we need is contained in the standard textbooks; cf PI, Q, and 0, for 
example. 

The paper is intended to provide some fundamentals concerning certain 
‘knowledge-sensitive’ computations. So, neither additional references to applica- 
tions are given deliberately nor even case studies, apart from this introduction. 
Moreover, numerous details are omitted due to the lack of space; this especially 
goes for proofs. 



2 Temporal Logic of Decreasing Sets 

As has been set out in the introduction we want to formalize, in a sense, the 
change of the knowledge state of some agent in the course of time. In the present 
section we assume that this set shrinks gradually. While shrinking will be ex- 
pressed by the usual connectives nexttime, henceforth and until of propositional 
linear time temporal logic (Oi Dj hi), the knowledge operator K associated with 
the agent quantifies over the set at any time. 

In order to define the set WFFd of well-formed formulas of the logical lan- 
guage we let PV = {p,q,r, . . .} be an enumerable set of propositional variables; 
the index ‘d’ indicates ‘decreasing’. Designating formulas by lower case Greek 
letters the set WFF^ is determined by the following recursive conditions then: 

a := p \ -la \ a t\ (3 \ lEdO | DdO | aUdfd \ Ka. 

Concerning duals of the one-place operators, we let 

Qdcr ~ ~'Cx., 'OdO! := — 'Dd“'Q!, La := —<K—<a. 

Moreover, the boolean connectives V and — >■ are treated as abbreviations. Note 
that we regard as basic the ‘universal’ version of nexttime, because we will have 
only partial functionality of the associated accessibility relation. 

Next we define the semantics of our language for the ‘shrinking’-case. As 
we would like to treat the decrease of a given set X, certain subsets of X have 
to be included in the formal model. Consequently, we take X and the system 
of those subsets, O, as basic ingredients of the domains where formulas are 
evaluated. However, the set O carries a time structure which is made explicit by 
the following definition. — Subsequently, let V{S) denote the powerset of S, for 
an arbitrary set S. 
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Definition 1. 1. Let X ^ % he a set and O a set of subsets of X. Furthermore, 

suppose that (/, <) is an initial segment of (N, <), and there exists an order- 
reversing surjective mapping from I onto O: 

d:{I,<)^{0,C). 

Then S := {X,I -^O) is called a sequence of decreasing sets. 

2. Let S = {X,I -^O) be a sequence of decreasing sets and V : PV — > 'P(X) 

a mapping. Then V is called a valuation, and M. := {X, L V) is called 

a temporal model of decreasing sets, or briefly a model (based on S). 

Notice that ‘the future includes the present’, as it is common in computer 
science. 

We are going to evaluate formulas in models at situations of the underlying 
sequence of decreasing sets, {X,L -^O). Such situations are simply pairs x, Ui 
satisfying x £ Ui = d{i) £ O, where i £ L; these pairs are designated without 
brackets. 

Definition 2. Let be given a model M. = {X,I ~^0,V) and a situation x,Ui 
of the sequence of decreasing sets which A4 is based on. Then we define 



X, Ui 


\=M 


P 


iff 


X £V{p) 






X, Ui 


\=M 


-<a 


iff 


X, Ui 


V^M 


a 




X, Ui 


\=M 


a A f3 


iff 


X, Ui 


\=M 


a and x, Ui \=m P 




X, Ui 


\=M 


Ka 


iff 
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U, : 


U-)Ui |=A1 ^ 




X, Ui 


\=M 


KldO; 


iff 


j + 1 


G I 


and X € imply x, 


Uj-\-l \=M « 


X, Ui 


\=M 


□ dO 


iff 


< 

IV 


i : {j £ I and x G Uj imply 


3^5 Uj Qi) 


X, Ui 


\=M 


aUdl3 


iff 


LU 

IV 




j £ I and x G Uj and 
and M j > k > i : x,Uk 





for all p £ PV and a,j3 £ WFFd. 

In case x, Ui \=m oi is satisfied we say that a holds in M. at the situation 
x, Ui; moreover, the formula a £ WFFd is said to be valid in M, iff it holds in 
M at every situation. 

Now we present a list of schemata aimed at providing a sound and complete 
axiomatization of all the formulas which are valid in every temporal model of 
decreasing sets. The schemata are divided into three groups, called axioms of 



sets, time, and decrease. 


respectively. 


— Axioms of sets: 


(SI) 


All instances of propositional tautologies 


(S2) 


K{a -A {Ka -A Kj3) 


(S3) 


Ka -A a 


(54) 


Ka -A KKa 


(55) 


La — y K La 
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— Axioms of time: 

(T 0) {p KIdp) A {-•p KId-'p) 
(Tl) Kld(a — >■ /?) — >■ (KldO! — >■ Kld/3) 

(T 2) Od« ^dO! 

(T 3) nd(o! — >■ /?) — >■ (ClldCi! — >■ Dd/3) 
(T 4) DdO! — >■ a A lEldDdo; 

(T 5) □d(o! — >■ KldO;) —>■(«—>■ □dO?) 

(T6) aUdP^OdP 

(T7) aWd/3^/3V (aAOd(aWd/3)) 

— Axioms of decrease: 

(HI) if Kid a — )> Kld-f^a 
(H2) Kdifa if Kd a V Kd /3 



The schemata of the ^-group make the operator if an S5-modality. This 
means to the accessibility relation i? of a usual Kripke frame (IT, R) validating 
(^S), (S' 4) and (S5), to be an equivalence. In addition, (SI) - (S 5) determine 
the logic of knowledge of a single ‘ideal’ agent; see |^. Presently these axioms 
enable us to realize sets as equivalence classes. 

Axiom (TO) of the second group is due to the fact that the semantics of 
propositional variables is defined independently of the set component of a situa- 
tion; in particular, all p S PV are true and false respectively, regardless of time. 
Moreover, the modality Kd represents a partial function because of the schema 
(T2); i.e., this axiom is valid in a Kripke frame {W,R), iff the accessibility re- 
lation R is partially functional. In our language of sets these axioms provide for 
the linear structure of sequences of decreasing sets on the level of points. The 
schemata (T 4) and (T 5) are intended to express that the accessibility relation 
corresponding with Dd is the reflexive and transitive closure of the accessibility 
relation corresponding with Kd. Finally, (T6) and (T7) describe the relevant 
properties of the until-operaior; its ‘fixed-point character’, in particular, is cap- 
tured by the last axiom of this group. 

The third group of axioms is the decisive one. The schemata of this group 
formalize the interaction of the unary operators (the ‘modal’ connectives) in- 
volved in the system. {D 1) is the real ‘axiom of shrinking’ and is typical of the 
various systems of topological reasoning; see 0|. Finally, (H2) is responsible for 
the linear structure of sequences of decreasing sets on the level of sets. 

Adding the following usual rules yields a logical system designated D; this 
letter should remind one of ‘decrease’, too: 



( 1 ) 



a — >■ /3, a 

P 



( 2 ) 



a 

KIda 



( 3 ) 



a 

□d« 



( 4 ) 



a 

Ka 



Our first result states the soundness of the just defined system. The proof is 
easy by reverting to the above definitions. 



Proposition 1. Let a G WFFd be an T)-derivable formula. Then a is valid in 
every temporal model of decreasing sets. 
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Fig. 1. Diagram properties 



Semantic completeness is much more complicated to prove. One starts off 
with a canonical model construction. Let 




denote the accessibility relations belonging to the one-place connectives Kl, □ 
and K, respectively, of the canonical model. (Notice that we have omitted the 
index ‘d’, and we do so for the rest of this section.) The above axioms provide for 
some good behaviour of these relations. In fact, we get the following important 
diagram properties: 



(1) For all points s,t,u of the canonical model such that s t — ^ u there 
exists a point v such that s -^v -^u. 



This is valid because of {D 1). 



(2) For all s, t, u, v such that s — ^ t u and s v there exists a point x 
such that s -^x — 



For this, (D2) is responsible; see Figure 1. 

It is crucial for our purposes that properties (1) and (2) are transmitted to a 
special filtration of the canonical model: the one having filtration set Moss and 
Parikh’s filtration set F (see Pj, p. 92) built on Goldblatt’s filtration set (see |0|, 

p. 92), and the minimal filtrations and i— ^ of and — ^ , respectively. 

(We need not consider at the moment.) In fact, marking elements of the 
filtration with a bar we get 

is an equivalence relation, 
u. Then there exists a point v such that 



Proposition 2. 1. The relation 

2. Suppose that we have s t 




156 



B. Heinemann 



3. Assume that s i-^v and s i-^t i-^u is valid for elements s, t, u, v of the 
filtration. Furthermore, let KI/3 G s for some formula KI/3 G F. Then there 
exists a point x such that s i-^x i— 

Beyond usual linear time temporal logic one has to be able to select a suitable 
successor of the whole -equivalence class of a point s under consideration, at 
least as far as the realization of Kl-formulas contained in F is concerned. That 
this faithfully is possible can be guaranteed by (3) of Proposition Q, 

Now, after some preprocessing peculiar to the new system the unwinding 
procedure known from linear time temporal logic (0, P- 96) is applicable, yield- 
ing a certain intermediate Kripke model A4' . With the aid of this model we can 
construct the desired temporal model A4 of decreasing sets. The domain of A4 
is a suitable space of partial functions having domain contained in N and range 
contained in A4'] furthermore, a function / belongs to the t-th subset iff f{i) 
exists, for all i G N. We can then prove an appropriate truth lemma, using the 
diagram properties essentially. This implies the first of our main results. 

Theorem 1. Let the formula a £ WFF be valid in every temporal model of 
decreasing sets. Then a is T) -derivable. 

The completeness proof roughly sketched above does not obviously yield the 
finite model property of our logical system. We have to proceed differently in 
order to achieve this result. First we introduce some prerequisite notions. To this 
end, let I := {I, <) be an initial segment of (N, <). 

1. A subset 0 J' C / is called a segment of I, iff there is no * £ / \ I' strictly 
between any two elements of I' . 

2. A partition of / into segments is called a segmentation of I. 

Let be given a formula a and a model M = {X, I — ^ O, V). We will have 
to consider segmentations of I = (/, <) such that the truth value of a remains 
unaltered on every segment, in the following sense. 

Definition 3. Let a £ WFF be a formula and M = {X,L ~^0,V) a model. 
Furthermore, let A be an indexing set and V := {I\ | A £ A} a segmentation of 
X = (/, <). Then a is called stable on V , iff for all X G A and x G X we have 

\/ i G X\ : {x G Ui => x,Ui \= a), or \/ i G X\ : (x G Ui => x,Ui \= ~<a) . 

Actually, one can always get a finite segmentation of I on which a is stable. 

Proposition 3. Let A4 = (X,L — ^0,P), X and a be as above. Then there 
exists a finite segmentation Va '.= {X\, . . . ,Xn} of X such that a is stable on Va. 
Moreover, Va can be chosen such that it refines Vp for every subformula j3 of a, 
and the number of segments is polynomial in the length of a. 
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Va can be constructed inductively, starting with the trivial segmentation 
{{I}} in case of a propositional variable (due to axiom (TO)). 

According to the first statement in Proposition 0 the question whether a 
given formula a is satisfiable can be reduced to models of ‘finite depth’. By a 
standard procedure of the logic of set spaces this question can then be whittled 
down to models which are of ‘finite width’ additionally; cf 0, 3.35, e.g. This 
eventually yields the finite model property, which implies decidability. 

But we get even more, due to the second statement in Proposition 0 counting 
segments carefully along the tree structure of a and utilizing the S5-properties 
of K (among other things) we obtain that the D-satisfiability problem is NP- 
complete. In view of m, Theorem 4.1, i.e., already because of the presence of 
the 0“ ^cid the D-operator, this result is somewhat surprising at first glance. 
However, axioms (T 0) and {D 2) are responsible for breaking down the com- 
plexity. 

Theorem 2. The set of formulas a S WFF being valid in every model, is co-NP 
complete. 



3 Temporal Logic of Increasing Sets 



In this section we model the successive growth of some given non-empty set X. 
Our proceeding here is very similar to that in Section 2; so, we may be brief 
currently and point up the differences only. 

Concerning the syntax we merely alter the index ‘d’ and let it be an ‘i’, 
indicating increasing. Semantically the same structures are considered as before, 
but we look upward the chain of subsets of X now, i.e., take (O, A) instead of 
(O, C) in Definition 0 Correspondingly, frames are called sequences of increasing 
sets. As to the validity relation between situations and formulas with respect to 
models, \=m^ nothing is changed; see DefinitionH Notice that the requirements 
‘x G Tj+i’ and ‘x G Uj’ in the clauses concerning Kl,, Di and Hi, respectively, 
become redundant now and can be left out therefore. 

More interesting things happen to the axiom schemata. One expects that 
the third group is concerned. In fact, while the axioms of sets and time remain 
unaltered we have the following 

— Axiom of increase: 

(71) KliTFo! — >■ 77 Kli a 



Interestingly enough, a further axiom of increase is missing, in contrast to 
the case of shrinking. The reason for this is that (1 1) is stronger than {D 1), in 
a sense. Here is an explanation: The diagram property corresponding with the 
schema (71) reads 



(3) for all points s,t,u of the canonical model such that s — ^ t u there 
exists a point v such that s -^v — 
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Fig. 2. Phases of a changing set 



Now, it is not hard to prove that this together with partial functionality of the re- 
lation implies linearity of the time structure of the —^-equivalence classes. 
Moreover, property (3) is also passed to the filtration considered in Section 2, 
which serves its purposes again thus. 

Let the system indicated above be designated I. Taking into account the 
described modifications one gets: 

Theorem 3. The system I is sound and complete with respect to models of 
increasing sets. Furthermore, the set of 1-derivahle formulas a G WFFj is co- 
NP complete. 

The segmentation techniques sketched in the previous section apply corre- 
spondingly, yielding the effectiveness issues of Theorem 0 The complexity result 
is somewhat surprising anew, in view of 4.7. Note that we are confronted 
once again with a situation similar to that of the preceding section: axiom (T 0) 
works together with a ‘linearity axiom for sets’, which equals (1 1) this time. 
But, since (II) is also an axiom of the usual logic of knowledge in synchronous 
systems with no learning, see Theorem 3.8 in HH, (TO) is ‘dominant’ in the 
present case. (In fact, in the proof of the result 4.7 in [Ej it is used that Axiom 
(TO) does not hold for the common logic of knowledge.) 

Completeness and decidability of a system similar to the present one has 
been obtained in US]. 

4 Combining the Systems D and I 

The final system is a combination of the systems D and I in the following sense: 
Building the set of well-formed formulas, each of the preceding rules is allowed. 
Concerning the semantics, the set O of subsets of the domain is structured by 
phases of growing and shrinking respectively; see Figure 2. This is made precise 
through the next definition. 

Definition 4. 7. Let X he a non-empty set, I an initial segment of N and 

d : I — > V(X) a mapping. Furthermore, let I' be a segment of I. Then d 
is called increasing on I', iff i < j d(i) C d(j) for all i,j G I', and 

decreasing on I', iff i < j d(i) A d(j) for all i,j G 
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2. Let X, I and d be as above. Moreover, let I := be a segmentation 

of I, where J is a suitable initial segment o/N, and Ck € {d, i} for all k € J. 
Then d is called faithful on I, iff d is increasing on I'ff in case Ck = i, and 
d is decreasing on I'ff in case Ck = d, for all k € J. 

We then regard as relevant only sequences of sets S = {X, I -^O) which are 
partitioned by a segmentation I in a way such that d is faithful on I. Let us call 
such structures proper. The evaluation of formulas in models based on proper 
sequences of sets respects the phases of I, and follows Definition 0 otherwise. 
Let us explain this more detailedly. So, let be given a model 

M = {X,I ^0,V) 

based on a proper sequence of sets. Let I = {I'ff)k^j be the corresponding 
segmentation of /, i.e., the segmentation where d is faithful on. Finally, let x, Ui 
be a situation of S. Then there exists a unique k G J such that i G . We write 
^’k(i) latter set. Quantification is defined with respect to this phase then. 

For example: 



X, Ui \=M □da : ^1=^ y j >i- (^j G A Cfe(q =dAx GUj x, Uj \=m 

In case of KIc it has to be required that i + 1 is contained in some ‘c-phase’, in 
the premise of the implication on the right-hand side (c G {d,i}). 

Apart from the schemata of Section 2 and Section 3 there is a single axiom 
of change expressing the transition from a shrinking phase to a growing one, and 
vice versa. 



— Axiom of change: 

(Cl) KMij3 

The corresponding system, C, is thus determined by the schemata (S'!) - 
(S' 5), (n) - (T7), (Dl), (D2), (71) and (Cl), and the rules given in Section 

2 . 

Let us discuss the axiom of change. Its effect on the canonical model is 
described in the following proposition. 



Proposition 4. Let [s] be an — > -equivalence class of the canonical model. 
Then there do not exist u,t G [s] and points t' ,u' such that t ^^t' and u . 



Consequently, the desired behaviour can be guaranteed on the canonical 
model: every equivalence class as a whole can exclusively be extended with re- 
spect to growing or shrinking, if it can be extended at all. 

Fortunately the behaviour of the equivalence classes is transmitted to the 
filtration, at least in the meaning of Proposition El3), i.e., as far as KIc-formulas 
contained in the filtration set are concerned (c G {d, i}). The reason for this is 

that we have taken the minimal filtration of and respectively. This 
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suffices for our purposes. For, the unwinding procedure mentioned in Section 2 
leads to a proper sequence of sets then. In this way we get semantic completeness 
of the system C. 

The previous decidability and complexity assertions remain valid in case of 
the combined system as well. In fact, segmentation of a given model is carried 
out with respect to the separate phases here so that only the last one is crucial to 
the finite model property. This property follows as above, and hence decidability, 
too. 

As to complexity, it must only be guaranteed additionally that the number 
of phases is polynomial in the length of the formula under consideration. But 
this is clear since that number coincides with the degree of alternately nesting 
d- and i-operators in the formula. 

We can state the main result of this paper now. 

Theorem 4. 1. The system C is sound and complete with respect to proper 

sequences of sets. 

2. The set of C -theorems is co-NP complete. 



5 Concluding Remarks 



We have presented a formalism originating from the logic of knowledge of an 
agent involved in a synchronous system. The underlying language can describe 
the change of the knowledge state of the agent ‘from his own point of view’. 
Moreover, the language can be viewed as a core language expressing in general 
the change of a given set in the course of time. 

Our results include completeness of the given axiomatization, decidability of 
the logic, and the determination of the complexity of the set of theorems. It 
turned out that the latter set is ‘only’ complete in co-NP. This result can be 
interpreted as simplicity of the logic. In fact, due to our special axioms neither 
the temporal component of the system contributes to the complexity nor the 
interplay between knowledge and time, so that the knowledge component has a 
determining influence on this alone (cf [Bj, Section 3.5). 

In view of the second possible held of application of our system remarked 
above it is desirable to extend the formalism in order to get to grips with points 
inside the given set explicitly (among other things). Maybe hybrid logic, see |2|, 
is a good candidate for this. Hybrid subset space logics will be a subject of future 
research. 

Another topic worth to be investigated is the purely modal treatment of 
changing sets. Interestingly enough, a complete axiomatization of structures of 
the type (AT, O), where X is a non-empty set and O forms a chain of subsets of 
X with respect to inclusion, is still missing in the framework of the modal logic 
of topology. 
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Abstract. Rabi, Rivest, and Sherman alter the standard notion of 
noninvertibility to a new notion they call strong noninvertibility, and 
show — via explicit cryptographic protocols for secret-key agreement 
(lESSlESnZI attribute this to Rivest and Sherman) and digital sig- 
natures IHSDmSi)7l — that strongly noninvertible functions would be 
very useful components in protocol design. Their definition of strong 
noninvertibility has a small twist (“respecting the argument given”) 
that is needed to ensure cryptographic usefulness. In this paper, we 
show that this small twist has a large, unexpected consequence: Unless 
P = NP, some strongly noninvertible functions are invertible. 

Keywords: Computational and Structural Complexity. 



1 Introduction 

Rabi, Rivest, and Sherman developed novel cryptographic protocols that require 
one-way functions with algebraic properties such as associativity (see jHSDdI 
and the attributions and references therein, esp. |She86IKR,S88] L Moti- 
vated by these protocols, they initiated the study of two-argument (2-ary, for 
short) one-way functions in worst-case cryptography. To preclude certain types 
of attacks, their protocols require one-way functions that are not invertible in 
polynomial time even when the adversary is given not just the function’s output 
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but also one of the function’s inputs. Calling this property of one-way func- 
tions “strong noninvertibility” (or “strongness,” for short), they left as an open 
problem whether there is any evidence — e.g., any plausible complexity-theoretic 
hypothesis — ensuring the existence of one-way functions with all the properties 
the protocols require, namely ensuring the existence of total, commutative, as- 
sociative one-way functions that are strongly noninvertible. This problem was 
recently solved by Hemaspaandra and Rothe |Hb91I| who show that if P 7^ NP 
then such one-way functions do exist. 

Unfortunately, Hemaspaandra and Rothe !i™| write: “Rabi and Sher- 
man also introduce the notion of strong one-way functions — 2-ary one- 

way functions that are hard to invert even if one of their arguments is given. 
Strongness implies one-way-ness.” The latter sentence could be very generously 
read as meaning “strong, one-way functions” when it speaks of “strongness,” 
especially since strongness alone, by definition, does not even require honesty, 
and without honesty the sentence quoted above would be provably, trivially, 
false. However, a more natural reading is that [H R99| is assuming that strongly 
noninvertible functions are always noninvertible. The main result of the present 
paper is that if P 7^ NP then this is untrue. So, even when one has proven a 
function to be strongly noninvertible, one must not merely claim that nonin- 
vertibility automatically holds (as it may not), but rather one must prove the 
noninvertibilityQ 

In the present paper, we study appropriately honest, polynomial-time com- 
putable 2-ary functions. We prove that if P 7^ NP then there exist strongly 
noninvertible such functions that are invertible (see Section |3 for precise defini- 
tions). This is a rather surprising result that at first might seem paradoxical. To 
paint a full picture of what happens if P 7^ NP, we also show the (nonsurpris- 
ing) result that if P 7^ NP then there exist appropriately honest, polynomial-time 
computable 2-ary functions that are noninvertible, yet not strongly noninvert- 
ible. 

So, why is the surprising, paradoxical-seeming result (that if P 7^ NP then 
some strongly noninvertible functions are invertible) even possible? Let us infor- 
mally explain. Let cr be a 2-ary function. We say a is noninvertible if there is 
no polynomial-time inverter that, given an image element z of a, outputs some 
preimage of z. We say a is strongly noninvertible if even when, in addition to any 
image element z of cr, one argument of cr is given such that there exists another 
string with which this argument is mapped to z, computing one such other argu- 
ment is not a polynomial-time task. So, why does strongness alone not outright 
imply noninvertibility? One might be tempted to think that from some given 

^ Since in [TTRjljl) only strong noninvertibility is explicitly proven, one might worry that 
the functions constructed in its proofs may be invertible. Fortunately, the construc- 
tions in the proofs in | HR9!l | do easily support and implicitly give noninvertibility 
as well; thus, all the claims of [HR.99J remain correct. Most crucially, on page 654 
of I HR, 99] . inverting the output (x, x) in polynomial time would give strings contain- 
ing one witness for membership of x in the given set in NP — P (if there are any such 
witnesses), which is impossible. 
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polynomial-time inverter g witnessing the invertibility of a one could construct 
polynomial-time inverters gi and 52 such that gi inverts a in polynomial time 
even when the *th argument is given (see Definition 0 for the formal details) . 
This approach does not work. In particular, it is not clear how to define gi when 
given an output z of a and a first argument a that together with a corresponding 
second argument is mapped to z, yet a is not the first component of g{z). (In 
fact, our main theorem implies that no approach can in general accomplish the 
desired transformation from g to gi, unless P = NP.) 

But then, why don’t we use a different notion of strongness that automati- 
cally implies noninvertibility? The answer is that the definitional subtlety that 
opens the door to the unexpected behavior is absolutely essential to the cryp- 
tographic protocols for which Rabi, Rivest, and Sherman created the notion 
in the first place. For example, suppose one were tempted to redefine “strongly 
noninvertible” with the following quite different notion: cr is “strongly noninvert- 
ible” if, given any image element z of cr and any one argument of a such that 
there exists another string with which this argument is mapped to z, computing 
any preimage of z (as opposed to “any other argument respecting the argument 
given”) is not a polynomial-time task. The problem with this redefinition is that 
it completely loses the core of why strongness precludes direct attacks against 
the protocols of Rabi, Rivest, and Sherman (it is difficult to explain why without 
giving here in full their protocols, and this isn’t a formal claim as their argu- 
ments themselves are not formal proofs of security, so suffice it to say that their 
intuitive argument is crucially drawing on the fact that the definition of strong 
noninvertibility includes the “respecting the argument given” feature, and this 
dependence will be immediately clear to anyone who reads their protocols). We 
will call the just-defined notion “overstrongness,” as it seems to be overrestrictive 
in terms of motivation — and we will prove that if P NP then overstrongness 
indeed is a properly more restrictive notion than strongness. 



2 Definitions 



Fix the binary alphabet E = {0,1}. Let e denote the empty string. Let 
(•,•) : E* X S* ^ E* be some standard pairing function, that is, some total, 
polynomial-time computable bijection that has polynomial-time computable in- 
verses and is nondecreasing in each argument when the other argument is fixed. 
Let FP denote the set of all polynomial-time computable total functions. The 
standard definition of one-way-ness used here is essentially due to Grollmann and 
Selman except that they require one-way functions to be one-to-one (inSHg, see 
also |Ko8,5IBe r77j and the surveys |Sel92IBHHR99] '): as in the papers pB97l 
IHR99IHom00j . their notion is tailored below to the case of 2-ary functions. 
Any general notions not explicitly defined can be found in standard complexity 
texts IH(;9:HPa,p94ym)G95l . 



Definition 1. 






Let p : E* X E* 



E* be any (possibly 



nontotal, possibly many-to-one) 2-ary funetion. 
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1. We say p is honest if and only if there exists a polynomial q such that: 

(Vz G image(p)) (3(a, 6) G domain(p)) [|a| + |6| < q{\z\) A p{a,b) = z], 

2. We say p is (polynomial-time) noninvertible if and only if the following does 
not hold: 

{3g G FP) (Vz G image(p)) [p{g{z)) = z], 

3. We say p is one-way if and only if it is honest, polynomial-time computable, 
and noninvertible. 



We now define strong noninvertibility (or strongness), which is a stand-alone 
property (i.e., with one-way-ness not necessarily required) of 2-ary functions. 
If one wants to discuss strongness in a nontrivial way, one needs some type 
of honesty that is suitable for strongness. To this end, we introduce below, in 
addition to honesty as defined above, the notion of s-honesty0 



Definition 2. (see, essentially, fR,S97IHP,99j l Let a : E* x E* ^ S* be any 
(possibly nontotal, possibly many-to-one) 2-ary function. 



1. We say a is s-honest if and only if there exists a polynomial q such that 
both (a) and (b) hold: 

(a) (Vz,a : (36) [cr(a,6) = z]) (36') [|6'| < q{\z\ -b |a|) A a{a,b') = z]. 

(b) (Vz,6 : (3a) [cr(a, 6) = z]) (3a') [|a'| < q{\z\ -b |6|) A cr(a',6) = z]. 

2. We say a is (polynomial-time) invertible with respect to the first argument 
if and only if 

(3gi G FP) (Vz G image(cr)) (Va, 6 : (a, 6) G domain(cr) A cr(a, 6) = z) 
[cr(a, 3 i((a,z))) = z]. 



3. We say a is (polynomial-time) invertible with respect to the second argu- 
ment if and only if 

(3g2 G FP) (Vz G image(cr)) (Va, 6 : (a, 6) G domain(cr) A cr(a, 6) = z) 
H92{{b,z)),b) = z]. 



4. We say a is strongly noninvertible if and only if a is neither invertible 
with respect to the first argument nor invertible with respect to the second 
argument. 

5. We say a is strongly one-way if and only if it is s-honest, polynomial-time 
computable, and strongly noninvertible. 



^ The strongly noninvertible functions in clearly are all s-honest, notwithstand- 

ing that s-honesty is not explicitly discussed in cHMi (or EsszEsna). 



l«BJBlia8B»l 
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It is easy to see that there are honest, polynomial-time computable 2-ary 
functions that are not s-honest0 and that there are s-honest, polynomial-time 
computable 2-ary functions that are not honest 0 

For completeness, we also give a formal definition of the notion of overstrong- 
ness mentioned in the last paragraph of the introduction. Note that overstrong- 
ness implies both noninvertibility and strong noninvertibility. 

Definition 3. Let a : E* x S* — >■ S* be any (possibly nontotal, possibly many- 
to-one) 2-ary funetion. We say a is overstrong if and only if for no f € FP with 
/ : {1, 2} X S* X E* — >■ E* x E* does it hold that for each i G {1,2} and for all 
strings z,a G E* : 

((36 G E*)\{a{a, 6) = z A i = 1) V (<t(6, a) = z A i = 2)]) 
cr(/(i,z,a)) = z. 



3 On Inverting Strongly Noninvertible Fnnctions 



It is well-known (see, e.g., jSelh2IHI )(lh5| l that 1-ary one-way functions exist 
if and only if P NP; as mentioned in |HRf)9IRS97j . the standard method to 
prove this result can also be used to prove the analogous result for 2-ary one-way 
functions. 



Theorem 1. (see 
functions exist. 






P NP if and only if total 2-ary one-way 



Now we show the main, and most surprising, result of this paper: If P y^ NP 
then one can invert some functions that are strongly noninvertible. 



Theorem 2. // P y^ NP then there exists a total, honest 2-ary function that is 
a strongly one-way function but not a one-way function. 



Proof. Assuming P y^ NP, by Theorem Q there exists a total 2-ary one-way 
function p. Define a function a : E* x E* ^ E* as follows: 



a{a, b) 



' y) if (3x, y,zG E*) [a = l{x, y) A b = Oz] 
Op(j/> z) if (3X, y,z G E*) [a = 0x Ab = l(y, z)] 

< Ixy if (3a:, y G E*) 

[(a = Oa: A 6 = Oy) V (a = la; A 6 = ly)] 
ab if a = e V 6 = e. 

\ 



^ For example, consider the function p : E* x E* — >■ E* defined by p(a, h) = 
^fiogiog(max(|6|,2))i jf p{a,b) = o6 if o y^ 0. TMs function is honest (as 

proven by p{e, x) = x) but is not s-honest, since for any given polynomial q there are 
strings b G E* and z = ^jth p{0,b) = z, but the smallest b' G E* 

with p{0,b') = z satisfies |6'| > g(|z| -|- |0|) = g(|'loglog(max(|fe|, 2))] + 1). 

^ For example, consider the function a : E* x E* — ^ E* that is defined by a(a,b) = 
j^riogiog(max(|o|,2))i |^| _ that is Undefined otherwise. This function is s- 

honest but not honest. 
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It is a matter of routine to check that a is polynomial-time computable, 
total, honest, and s-honest (regardless of whether or not p, which is honest, is 
s-honest). 

If one could invert a with respect to one of its arguments then one could 
invert p, contradicting that p is a one-way function. In particular, supposing cr 
is invertible with respect to the first argument via inverter g\ G FP, we can 
use Pi to define a function g G FP that inverts p. To see this, note that given 
any w € image(p) with w ^ e, pi((0,0rc)) must yield a string of the form 
b = l{y,z) with p(y,z) = w. Thus, a is not invertible with respect to the first 
argument. An analogous argument shows that a is not invertible with respect to 
the second argument. Thus, a is strongly noninvertible. However, a is invertible, 
since every string z G image(CT) has an inverse of the form (e, z); so, the FP 
function mapping any given string z to (e, z) is an inverter for cr. Hence, cr is not 
a one-way function. | 

The converse of Theorem Elimmediately holds, as do the converses of Propo- 
sition d Corollary d and Theorems d d and 0 However, although all these 
results in fact are equivalences, we will focus on only the interesting implication 
direction. 

For completeness, we mention in passing that, assuming P NP, one can 
construct functions that — unlike the function constructed in the proof of Theo- 
rem 0 — are strongly one-way and one-way. An example of such a function is the 
following modification a of the function a constructed in the proof of Theorem 0 
As in that proof, let p be a total 2-ary one-way function, and define function 
a : E* X E* ^ E* hy 



( y) if (3a;, p, z G E*) [a = l{x,y) ^b= Oz] 
o-(a, 6) = < Op(p, z) if (3a;, p, z G E*) [a = Oa; A 6 = l(p, z)j 
[ la6 otherwise. 

Note that a even is overstrong; hence, a is both noninvertible and strongly 
noninvertible. That is: 

Proposition 1. // P NP then there exists a total, honest, s-honest, 2-ary 
overstrong function. (It follows that ifV^ NP then there exists a total 2-ary 
function that is one-way and strongly one-way.) 

Corollary d below shows that if P y^ NP then there is an s-honest 2-ary one- 
way function that is not strongly one-way. First, we establish a result that is 
slightly stronger: For a function to be not strongly noninvertible, it is enough 
that it is invertible with respect to at least one of its arguments. The function cr 
to be constructed in the proof of Theorem 0below even is invertible with respect 
to each of its arguments. 

Theorem 3. // P yf NP then there exists a total, s-honest 2-ary one-way func- 
tion a such that a is invertible with respect to its first argument and a is invertible 
with respect to its second argument. 
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Proof. It is well-known (ISiMl Prop. 1], in light of the many-to-one analog of 
his comment p. 209] about totality) that under the assumption P ^ NP 

there exists a total 1-ary one-way function p : E* — >■ E* . Define a function 
a ■. E* X E* ^ E* as follows: 



a(a, b) 



lp(a) if a = 6 
Oab if a b. 



Note that cr is polynomial-time computable, total, s-honest, and honest. If cr were 
invertible in polynomial time then p would be too; so, cr is a one-way function. 
However, a is invertible with respect to each of its arguments. For an inverter 
with respect to the first argument, consider the function g\ : E* — > E* defined 

by 

r 6 if (3a, &, z G H*) [a; = (a, Oz) f\ z = ab] 
gi{x) = < a if (3a, z G if*) [x = (a, lz)j 
I e otherwise. 

Clearly, gi G FP. Note that for every y G image(cr) and for every a G E* for 
which there exists some b G E* with a{a, b) = y, it holds that cr(a, gi((a, y))) = y, 
completing the proof that a is invertible with respect to the first argument. To 
see that a also is invertible with respect to the second argument, an analogous 
construction (with the roles of the first and the second argument interchanged) 
works to give an inverter 52 for a fixed second argument. | 



Corollary 1. //P NP then there exists a total, s-honest 2-ary one-way func- 
tion that is not strongly one-way. 



One might wonder whether functions that are not strongly noninvertible 
(which means they are invertible with respect to at least one of their arguments) 
outright must be invertible with respect to both of their arguments. The following 
result states that this is not the case, unless P = NP. 



Theorem 4. // P NP then there exists a total, s-honest 2-ary one-way func- 
tion that is invertible with respect to one of its arguments ( thus, it is not strongly 
one-way), yet that is not invertible with respect to its other argument. 

Proof. Assuming P y^ NP, by Theorem Q there exists a total 2-ary one-way 
function, call it p. Since our pairing function is onto and one-to-one, and its 
inverses are efficiently computable, the functions — tti and 7T2 — mapping from 
each string in E* to that string’s first and second components when interpreted 
as a pair are well-defined, total, polynomial-time functions; for all b G E* , b = 
(7Ti(&), 7T2(&)). Define a function a : E* x E* ^ E* as follows: 

a{a,b) = p{TTi{b),TT 2 {b)) 

It is clear that a is honest (via p’s honesty) and s-honest. Let ao be any fixed 
string, and define g 2 {w) = ao for all strings w. Clearly, g 2 G FP. The definition 
of (7 implies that for each z = p(x, y) G image(cr) and for each b G E* such that 
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cr(a, b) = z for some a € E*, it also holds that (r(ao, b) = z. Thus, a is invertible 
with respect to the second argument via 52 • However, if a were also invertible 
with respect to the first argument via some function G FP, then gi could be 
used to invert p, which would contradict the noninvertibility of p. Hence, a is 
invertible with respect to its first, yet not with respect to its second argument. 
Analogously, we can define a function that is invertible with respect to its second 
argument, yet not with respect to its first argument. I 

Finally, let us turn to the notion of overstrongness (see Definition ED men- 
tioned in the last paragraph of the introduction. As noted there, this notion is 
not less restrictive than either noninvertibility or strong noninvertibility, and so 
if a given polynomial-time computable, honest, s-honest function is overstrong 
then it certainly is both one-way and strongly one-way. Even though overstrong- 
ness is not well-motivated by the cryptographic protocols of Rabi, Rivest, and 
Sherman Essa, for the purpose of showing that the notions do not collapse, we 
will prove that the converse does not hold, unless P = NP. 

Theorem 5. //P yf NP then there exists a total, honest, s-honest 2-ary function 
that is noninvertible and strongly noninvertible hut that is not overstrong. 

Proof. Assume P NP. It is known (see that this assumption implies 

that total 1-ary one-way functions exist. Let p be one such function, and let 
p be such that it additionally satisfies (3r > 2) (Vx G E*) [|p(cc)| = \xf -\- r]. 
Henceforth, r will denote this value r. That this condition can be required follows 
easily from the standard “accepting-paths-based” proofs that P yf NP implies 
the existence of total 1-ary one-way functions. 

Define a total function p : E* ^ E* as follows: 

( lp{x) if (3x G A*) [a = la;] 

P(®) = \ a if (3s; G E* ) [a = Oa;] 

1 e if a = e. 



Note that p is a 1-ary, total one-way function satisfying that for each i > 0, 
p(0®) = 0*. Now define the total function a : E* x E* ^ E* as follows: 



a{a, b) = < 



l(p(x), if (3a;, y G E*) [jxj = \y\ A a = 0{x, y) = h\ 
l(p(x),0l*'l)if {3x,yGE*) 

[|a;| = \y\Aa = l{x,0y) Ab = l(a;, lp(?/))] 
l(p(x),0l*'l)if (3x,yGE*) 

[|a;| = \y\Aa = l(a;, lp(y)) Ab= l(a;, Oy)] 
0{a,b) otherwise. 



Clearly, a is polynomial-time computable, honest, s-honest, and commutative. 
If a were invertible, p would be too. Thus, tr is a one-way function. 

Note that a is strongly noninvertible, for if it could be inverted with respect 
to either argument then p could be inverted too. Suppose, for example, a were 
invertible with respect to the first argument via inverter gi G FP. Then p could 
be inverted as follows. Given any z G E* , if there is no A; G IN with fc’' -|- r = 
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|z|, there is no inverse of z under p; so, in that case we may output anything. 
Otherwise (i.e., there is a A: G IN with + r = |z|), run g\ on input (a,w), 
where a = 1(0^, Iz) and w = 1(0^, 0^). By the definition of ct, if z S image(p), 
the result of gi{{a,w)) must be of the form 1(0^, Oz) for some preimage z of z 
under p, and we can verify this by running p on input z and checking whether 
or not p(z) = z. A similar argument shows that a is not invertible with respect 
to the second argument. Hence, a is strongly one-way. 

Finally, we claim that cr is not overstrong. Here is what an inverter / does 
when given i = an alleged first argument a € E* of a, and an alleged 
output z € E* of a: 






' (x, y) if (3x, y e E*) [z = 0(x, y)] 

(a, a) if (3x, y G E*) (3m € IN) 

[a = Ox A z = l(2/,0™)] 

(0(w, w),0(w, w)) if (3w, x,y G A*) (3m S IN) 

[a = l(w,0x) A z = l(j/,0"*)] 
(0('u;, w),0{w, w)) if (3w, x,y G E*) (3m G IN) 

[a = l(u>, lx) A z = l{y, 0™)] 
^ (e, e) otherwise. 



Note that / G FP. Whenever there exists some string b G E* for which cr{a, b) = 
z, it holds that cr(/(l, a, z)) = z. (If there is no such b, it does not matter what 
/(I, a, z) outputs.) Hence cr is not overstrong. | 



Acknowledgments. We thank Osamu Watanabe for mentioning to us the no- 
tions, different from those used here though slightly reminiscent, from average- 
case theory, of claw-free collections, collision-free pseudorandom generators, and 
collision- free hash functions. We thank Chris Homan for suggesting overstrong- 
ness. 



References 



[BC93] 

[BDG95] 

[Ber77] 

[BHHR99] 

[GS88] 



D. Bovet and P. Grescenzi. Introduction to the Theory of Complexity. 
Prentice Hall, 1993. 

J. Balcazar, J. Diaz, and J. Gabarro. Structural Complexity I. EATCS 
Texts in Theoretical Computer Science. Springer- Verlag, second edition, 
1995. 

L. Berman. Polynomial Reducibilities and Complete Sets. PhD thesis, 
Cornell University, Ithaca, NY, 1977. 

A. Beygelzimer, L. Hemaspaandra, C. Homan, and J. Rothe. One-way 
functions in worst-case cryptography: Algebraic and security properties 
are on the house. SICACT News, 30(4):25-40, December 1999. 

J. Grollmann and A. Selman. Complexity measures for public-key cryp- 
tosystems. SIAM Journal on Computing, 17(2):309-335, 1988. 



® Since a is commutative, this implicitly also shows how to handle the case i = 2. 



If P 7 ^ NP then Some Strongly Noninvertible Functions Are Invertible 



171 



[HomOO] 

[HR99] 

[Ko85] 

[KRS88] 

[Pap94] 

[RS93] 

[RS97] 

[Sel92] 

[She86] 



C. Homan. Low ambiguity in strong, total, associative, one-way functions. 
Technical Report TR-734, University of Rochester, Department of Com- 
puter Science, Rochester, NY, August 2000. 

L. Hemaspaandra and J. Rothe. Creating strong, total, commutative, 
associative one-way functions from any one-way function in complexity 
theory. Journal of Computer and System Seiences, 58(3):648-659, 1999. 
K. Ko. On some natural complete operators. Theoretical Computer Sci- 
ence, 37(l):l-30, 1985. 

B. Kaliski Jr., R. Rivest, and A. Sherman. Is the data encryption standard 
a group? (Results of cycling experiments on DES). Journal of Cryptology, 
l(l):3-36, 1988. 

C. Papadimitriou. Computational Complexity. Addison- Wesley, 1994. 

M. Rabi and A. Sherman. Associative one-way functions: A new paradigm 
for secret-key agreement and digital signatures. Technical Report CS-TR- 
3183/UMIACS-TR-93-124, Department of Computer Science, University 
of Maryland, College Park, Maryland, 1993. 

M. Rabi and A. Sherman. An observation on associative one-way functions 
in complexity theory. Information Processing Letters, 64(2):239-244, 1997. 
A. Selman. A survey of one-way functions in complexity theory. Mathe- 
matical Systems Theory, 25(3):203-221, 1992. 

A. Sherman. Cryptology and VLSI (a Two-Part Dissertation). PhD 
thesis, MIT, Cambridge, MA, 1986. Available as Technical Report 
MIT/LCS/TR-381. 




Prediction-Preserving Reducibility 
with Membership Queries on Formal Languages* 

Kouichi Hirata^ and Hiroshi Sakamoto^ 

^ Department of Artificial Intelligence, Kyushu Institute of Technology 
Kawazu 680-4, lizuka 820-8502, Japan 
Tel: -^81-948-29-7622, Fax: -f81-948-29-7601 
hirataSai .kyutech. ac . jp 
^ Department of Informatics, Kyushu University 
Hakozaki 6-10-1, Fukuoka 812-8581, Japan 
Tel: -k81-92-642-2693. Fax: -f81-92-642-2698 
hiroshiSi .kyushu-u. ac . jp 



Abstract. This paper presents the prediction-preserving reducibility 
with membership queries {pwm-reducibility) on formal languages, in 
particular, simple CFGs and finite unions of regular pattern languages. 
For the former, we mainly show that DNF formulas are pwm-reducible 
to CFGs that is sequential or that contains at most one nonterminal. For 
the latter, we show that both bounded finite unions of regular pattern 
languages and unbounded finite unions of substring pattern languages 
are pwm-reducible to DFAs. 

Keywords: prediction-preserving reduction with membership queries, 
prediction with membership queries, context-free grammars, pattern lan- 
guages, grammatical inference, learning theory. 



1 Introduction 

The task of predicting the classification of a new example is frequently discussed 
from the viewpoints of both passive and active settings. In a passive setting, the 
examples are all chosen independently according to a fixed but unknown proba- 
bility distribution, and the learner has no control over selection of examples na 
CH]. In an active setting, on the other hand, the learner is allowed to ask about 
particular examples, that is, the learner makes membership queries, before the 
new example to predict is given to the learner m . 

Concerned with language learning, we can design a polynomial-time algo- 
rithm to predict deterministic finite automata (DFAs) in an active setting |^, 
while predicting DFAs is as hard as computing certain apparently hard crypto- 
graphic predicates in a passive setting M- Furthermore, predicting nondeter- 
ministic finite automata (NFAs) and context-free grammars (CFGs) is also hard 

* This work is partially supported by Japan Society for the Promotion of Science, 
Grant-in- Aid for Scientific Research (B) nos. 11558040, 13558029 and 13558036, and 
Grant-in- Aid for Encouragement of Young Scientists no. 12780233. 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 1 72- TT^ 2001. 
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under the same cryptographic assumptions in an active setting 0 . Here, the 
cryptographic assumptions denote the intractability of inverting RSA encryp- 
tion, recognizing quadratic residues and factoring Blum integers. 

Pitt and Warmuth UHl have been formalized the model of a prediction and 
a reduction between two prediction problems that preserves polynomial-time 
predictability called the prediction-preserving reduction in a passive setting. An- 
gluin and Kharitonov have extended to the prediction and the reduction in an 
active setting. The reduction is called the prediction-preserving reduction with 
membership queries or a pwm-reduction for short. 

The prediction is an weaker learning model than PAC-learning or query learn- 
ing models jbibll bj : If a class is polynomial-time learnable with equivalence (and 
membership) queries, then it is polynomial-time PAC-learnable (with member- 
ship queries), and if a class is polynomial-time PAC-learnable (with membership 
queries), then it is polynomial-time predictable (with membership queries). 

Except the above general results, the detailed results of the pwm-reducibility 
on formal languages, for example, the restricted CFGs or another languages such 
as pattern languages, have few found elsewhere. Furthermore, many researchers 
have been interested in the pwm-reducibility on Boolean concepts but not on 
formal languages mmEi- Hence, in this paper, we pay our attention to the 
pwm-reducibility on simple CFGs and finite unions of regular pattern languages. 

For the former classes, we introduce several simple CFGs as follows: the linear 
grammars (/lunear), the right-linear grammars (Zlright-iinear)i and the left-linear 
grammars (£ieft-iinear) as usual; the k-bounded CFGs ^ (/Ifc-bounded-CFc) each 
of which right-hand side of productions contains at most k nonterminals; the se- 
quential CFGs [/WZ4\ (/IsqCFc) that the set of nonterminals has a partial order < 
such that T — >■ vUw iST < U for nonterminals T and U; the properly sequential 
or loop- free CFGs (/IpsqCFc) that is sequential but disallowing to occur the same 
nonterminal in both left- and right-hand sides for each production; the k-CFGs 
(jCfe-cFc) that contains at most k nonterminals; the parenthesis grammars m 
(^paren) that each production is of the form T — >■ [ic]. 

On the other hand, a pattern 0 is a string consisting of constant symbols 
and variables, a regular pattern [23 is a pattern in which each variable occurs 
at most once, and a substring pattern m is a regular pattern of the form xwy 
for a constant string w and variables x and y. The language of a pattern is the 
set of constant strings obtained by substituting nonempty constant strings for 
variables in the pattern. Then, for the latter classes, we deal with the bounded 
finite union of regular pattern languages by some constant m (/Iu^Rp) 
the unbounded finite union of regular or substring pattern languages (£urp or 

■^Usubp) ■ 

We denote that Li is pwm-reducible to £2 by £1 <pwm £2, and that 
£1 ^pwm £2 and £2 <pwm £1 by £1 =pwm £2- Then, this paper presents the 
results described as Fig. ^ Hence, we can conclude the following polynomial- 
time predictability with membership queries. 
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?Cnfa 


= pwm 


bright- linear 5 -^left-linear 


TInfa 


^pwm 


-^linear? -^fc-bounded-CFG 


.Cdnf 


^pwm 


-CaqCFG, -CpsqCFG, Ck-CFG 


>CuDFA ^pwm 


^paren 


-^UsubP ^pwm -^DFA 



Fig. 1. The pwm-reducibility 



1- .^linear? .^right-linear? .^left-linear? .^/c-bounded-CFG ^ 1) 3.nd /^paren IlOt 

polynomial-time predictable with membership queries under the crypto- 
graphic assumptions. 

2. If £sqCFG? ^CpsqCFG and £fc-cFG > 1) are polynomial-time predictable 
with membership queries, then so are DNF formulas. 

3. /lumRP (ni > 0) and £usubP are polynomial-time predictable with member- 
ship queries. 

Concerned with the statement 2, this paper corrects and extends some results 
obtained by the paper m- Furthermore, concerned with the statement 3, we 
can show that /Idnf ^pwm 'Curp, so if £urp is polynomial-time predictable with 
membership queries, then so are DNF formulas m- 

2 Preliminaries 

Let E and N be two non-empty finite sets of symbols such that E n N = $. A 
production A — >■ a on E and N is an association from a nonterminal A G N to 
a string a G {N U A)*. A context-free grammar {CFG, for short) is a 4-tuple 
{N, E, P, S), where S G N is the distinguished start symbol and P is a finite set 
of productions on E and N. Symbols in N are said to be nonterminals, while 
symbols in E terminals. Then, we deal with the following subclasses of CFGs. 

— A linear grammar is a CFG G = {N, E, P, S) such that each production 
in P is of the forms T — >■ wUv or T ^ w for T,U G N and w,v G E*. In 
particular, a right-linear {resp., left-linear) grammar if it is a linear grammar 
such that each production is of the forms either T ^ wU {resp., T ^ Uw) 
or T — >■ re for T,U G N and w G E*. 

— A CFG G = {N, E, P, S) is called k-bounded g] if the right-hand side of each 
production in P has at most k nonterminals. 

— A CFG G = {N, E, P, S) is called sequential f7l24| if the nonterminals in 

N are labeled S = Ti, . . . ,T„ such that, for each production Ti ^ w, w G 
{E U {Tj \ i < 3 < n})*. In particular, a sequential CFG satisfying that 
w G {E U {Tj I i < j < n})* for each production — >■ w is called properly 

sequential or loop-free. 

— A CFG G = {N, E, P, S) is called a k-CFG if |fV| < k. 

— A parenthesis grammar is a CFG G = {N, AU{[,]}, P, S) such that each 
production in P is of the form T -G [tc] for T G N and w G {N UA)*. 
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Let G be a CFG {N, S, S, P) and a and (3 be strings in {SVJN)* . We denote 
a =>G P if there exist 01,02 G (if U N)* such that o = 01X02, f3 = 01702 
and if — >■ 7 S P. We extend the relation to the reflexive and transitive 
closure =>q. For a nonterminal A G N, the language Lq{A) of A is the set 
{w G E* \ A w}. The language L{G) of G just refers to Lg{S). 

Next, we introduce the notions of patterns [ 2 j. Let X be a countable set of 
variables such that if fl X = 0 . A pattern is an element of (X U X)“*'. A pattern 
7T is called regular m if each variables in tt occurs at most once. In particular, 
a regular pattern of the form xwy is called a substring pattern m if x,y G X 
and w G 

A substitution is a homomorphism from patterns to patterns that maps 
each symbol a G E to itself. A substitution that maps some variables to an 
empty string e is called an e- substitution. In this paper, we do not deal with 
£-substitutions. By tt 9, we denote the image of a pattern by a substitution 
9. For a pattern tt, the pattern language L(n) is the set {w G E~^ \ w = 
tt 9 for some substitution 9}. 

3 Prediction with Membership Queries 

In this section, we introduce definitions and theorems for prediction and 
prediction-preserving reduction with membership queries due to Angluin and 
Kharitonov j 0 |. 

Let U denote E* . If u> is a string, |w| denotes its length. For each n > 0 , 
C/N = 

{w G [/ I |i/;| < n}. A representation of concepts C is any subset ofU xU. 
We interpret an element {u, w) oi U x U as consisting a concept representation 
u and an example w. The example w is a member of a concept u if (u,w) G C. 
Define the concept represented by u as kc{u) = {w | (u,w) G £}. The set of 
concepts represented by C is {kc:(m) | u G U}. 

To represent CFGs, we define the class £cfg as the set of pairs (u, w) such 
that u encodes a GFG G and w G L{G). Also we define the classes £imear, 

■^right-linear ; -linear? Gfe-bounded-CFG) tlseqGFGj •^psqCFGj ^k-CFG, and C paren? 

corresponding to linear grammars, right-linear grammars, left-linear grammars, 
fc-bounded GFGs, sequential GFGs, properly sequential GFGs, fc-GFGs, and 
parenthesis grammars, respectively, as similar. 

To represent finite unions of regular pattern languages, we define the class 
■CumRP as the set of pairs {u, w) such that u encodes m and a finite set tti, • • • , 
of TO regular patterns and w is in the concept represented hy c iS w G L^iTi) 
for at least one tt^. Similarly, we define the class £urp (resp., Xusubp) as the 
set of pairs {u, w) such that u encodes a finite set tti, • • • , tTj. of regular {resp., 
substring) patterns and w is in the concept represented by c iff w G LlyKi) for at 
least one tt^. Note that £umRP denotes the bounded finite unions, whereas £urp 
and GusubP denote the unbounded finite unions. 

Additionally, we introduce the following classes. The class £dfa {resp., £nfa) 
denotes the set of pairs {u, w) such that u encodes a DFA {resp., NFA) M and M 
accepts w. The class £udfa of finite union of DFAs denotes the set of pairs {u, w) 
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such that u encodes a finite set Mi , • • • , Mr of DFAs and w is in the concept 
represented by c iff at least one Mi accepts w. The class £dnf denotes the set 
of pairs {u, w) such that u encodes a positive integer n and a DNF formula d 
over n Boolean variables xi, - ■ ■ ,x„ such that \w\ = n {w = w\ ■ ■ ■ Wn) and the 
assignment Xi = Wi (1 < z < n) satisfies d. 

Angluin and Kharitonov have generalized the definitions of Pitt and War- 
muth of prediction algorithm m to allow membership queries as follows. 

Definition 1 (Angluin & Kharitonov |5]). A prediction with membership 
queries algorithm, or pwm-algorithm, is a possibly randomized algorithm A that 
takes as input n (a bound on the size of examples), s (a bound on the size of 
the target concept representations), and e (an accuracy bound). It may make 
three different kinds of oracle calls, the responses to which are determined by 
the unknown target concept c, and the unknown distribution D on 

1. A membership query PP takes a string w € U as input and returns 1 if 
w € Ci,; and 0 otherwise. 

2. A request for a random classified example takes as no input and returns a 
pair (w,b), where w is a string chosen independently according to D and 
6 = 1 if a; G c* and b — 0 otherwise. 

3. A request for an element to predict takes no input and returns a string w 
chosen independently according to D. 

A may make any number of membership queries or requests for random classified 
examples, whereas A must eventually make one and only one request for an 
element to predict and then eventually halt with an output 0 or 1 without 
making any further oracle calls. The output is interpreted as A’s guess of how 
the target concept classifies the element returned by the request for an element 
to predict. A runs in polynomial time if its running time (counting one step per 
oracle call) is bounded by a polynomial in n, s and 1/e. 

Definition 2 (Angluin &: Kharitonov [6]). Let £ be a representation of 
concepts and c* be the unknown target concept in C. We say that A successfully 
predicts C if, for each positive integer n and s, for each positive rational e, for 
each concept representation u € U^'^\ for each probability distribution D on 
[/["], when A is run with input n, s and e, and oracles determined by c* = nc{u) 
and D, A asks membership queries that are in U and the probability in at most 
e that the output of A is not equal to the correct classification of w by kc(u), 
where w is the string returned by the (unique) request for an element of predict. 

Definition 3 (Angluin & Kharitonov ^). A representation C of concepts 
is polynomial-time predictable with membership queries if there exists a pwm- 
algorithm A that runs in polynomial time and successfully predicts C. 

It is well known the following statements: 

1. £dfa is polynomial-time predictable with membership queries P|. 
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2. £udfa, >Cnfa o.nd £cfg “t'e not polynomial-time predictable with member- 
ship queries under the cryptographic assumptions 0. 

3. £dnf is either polynomial-time predictable or not polynomial-time pre- 
dictable with membership queries, if there exist one-way functions that cannot 
be inverted by polynomial- sized circuits jS|. 



Definition 4 (Angluin & Kharitonov |S]). Let Ci be a representation of con- 
cepts over domain Ui (i = 1,2). We say that predicting L\ reduces to predicting 
£2 with membership queries {pwm-reduces , for short), denoted by Li <pwm ^^2, 
if there exist an instance mapping f : ISS x N x Ui ^ U2, & concept mapping 
g : N X N X £1 — >■ £2, and a query mapping h : N x N x U 2 ^ Ui U {T, _L} 
satisfying the following conditions. 

1 . For each x € and u G £j*^, x G kcj{u) iff f{n,s,x) G nc^igin, s,u)). 

2 . / is computable in time bounded by a polynomial in n, s and \x\. 

3 . The size of g{n, s, u) is bounded by a polynomial in n, s and |m|. 

4 . For each x' G U2 and u G £!j^^, if h{n,s,x') = T then x' G Kc^igin, s,u))] if 
h{n, s, x') = _L then x' ^ ^£2 [g{n, s, u)); if h{n, s, x') = x € Ui, then it holds 
that x' G KC 2 {gin, s, u)) iff x G (u). 

5 . h is computable in time bounded by a polynomial in n, s and |x'|. 

Furthermore, we denote that £1 <pwm ^2 and £2 ^pwm ^1 by £1 =pwm J^ 2 - 
The following theorem is useful for showing the predictability or the hardness 
of predictability of the representations of concepts. 

Theorem 1 (Angluin Kharitonov JR]). Let L\ and £2 be representations 
of concepts and suppose that £1 <pwm ^ 2 - 

1. If C 2 'Is polynomial-time predictable with membership queries, then so is L\. 

2. If Cl is not polynomial-time predictable with membership queries, then nei- 
ther is £2. 

4 Prediction-Preserving Reducibility with Membership 
Queries 

In this section, we fix /, g and h to an instance mapping, a concept mapping, 
and a query mapping. Furthermore, the parameters n and s denote the bounds 
of examples and representations, respectively. For simplicity, we assume that the 
length of examples for Boolean concepts is always fixed to the upper bound n. 

4.1 Simple CFGs 

First of all, by using the transformation from a DFA to a right-linear grammar 
(c/. it holds that £DFA^pwm£right-iinear, because the size of the right-linear 

grammar is bounded by a polynomial in the size of a DFA. Note that the converse 
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direction £right-iinear ^pwm >Cdfa do6S not follows from the transformation from 
a right-linear grammar to a DFA, because the size of the DFA is not bounded 
by a polynomial in the size of a right-linear grammar in general. 

Furthermore, we point out that £dnf ^pwm^CuDFA) while £dnf ^^dfa fTRj - 
Here, the < means the prediction-preserving reduction without membership 
queries introduced by Pitt and Warmuth CHI, that is, there exist / and g sat- 
isfying the requirement from 1 to 3 in Definition ^ Note that we cannot apply 
the same proof of £dnf ^ >Cdfa to proving £dnf ^pwm >Cdfa; In this case, we 
cannot construct h correctly. 

On the other hand, by regarding the equivalent transformation between an 
NFA and a right-linear grammar [SIDj as a concept mapping g, we observe that 
■^NFA =pwm -^right-linear- Furthermore, for a CFG G = {N, A, P, S), let be a 
CFG {N, A, P', S) such that T — >• G P' for each T ^ w G P. Here, R denotes 

the reversal of a word. Then, for a right-linear (resp., left-linear) grammar G, 
construct the following /, g and h: 

f{n,s,e) = e^, 
g(n,s,G) = G^, 
h{n, s, e') = e'^. 



It is obvious that ifright-linear ^pwm -^left-linear {T6Sp.^ -^left-linear ^pwm -^right-linear) , 
SO it holds that /Iright-iinear — pwm ^left-linear- Similarly, we also observe that 
-^NFA ^pwm -^linear and £nfa ^pwm -£fc-bounded-CFG for each fc > 1. Summary: 

Theorem 2. £ for £ G {£right -linear, £left -linear}- A/sO, £nFA ^pwm 

£linear UTfd £]SjpA ^pwm £A:-bounded-CFG Gdch k^\. 



Theorem 3. £dnf ^pwm £ for £ G {£psqGFG, £sqCFGl- 



Proof. Let d be a DNF formula ti V - - - V over n Boolean variables x \ , 
First, we define wl (1 < i < n, 1 < _) < m) as follows: 



W-' 



1 if tj contains Xi, 
0 if tj contains aii, 
T otherwise. 



Xn- 



Then, construct /, g and h as follows: 
y(?7-, s, e) ^ e, 

g(n, s, d) = ({S', Tj, (0, 1}, S, (S ic} • • • w), | • • • | w)" • • - w™, T -)> 0 | 1}), 
h{n, s, e') = e' . 



Note that g{n,s,d) is a properly sequential CFG. It is obvious that the above 
/, g and h satisfy the conditions of Definition 0 □ 



Theorem 4. For each k >1, £dnf ^pwm £fc-CFG- 



Prediction-Preserving Reducibility 179 



Proof. Theorem 0 implies that £dnf ^pwm J^k-CFG for each k > 2, so it is 
sufficient to show that £DNF^pwm>Ci-cFG- Let d = tiV- ■ -Vim be a DNF formula 
over n Boolean variables xi, . . . , First, define wf (1 < i < n, 1 < j < m) as 
follows: 

{ 1 if tj contains Xi, 

0 if tj contains xf, 

S otherwise. 

Then, construct /, g and h as follows: 

f{n,s,e) = e, 
g{n,s,d) = ({S'}, {0,1}, 5, 

n+1 2n 

( e' if |e'| = n, 

h{n, s, e') = < T if 1 < |e'| < n, 

[ T if |e'| = 1 or \e'\ > n. 

For each e G {0,1}”, it holds that e satisfies d iff S ^ f{n,s,e). 

Furthermore, for each e' G {0,1}*, if h{n,s,e') = T, then S s d) 
cause g{n, s, d) generates no strings of length more than 1 and less than n; If 
h{n, s, e') = e', then it holds that S =>*(„ ^ e' iff h{n, s, e') satisfies d. 

Finally, consider the case that h{n, s, e') = T. It is sufficient to show that, for 
each fc > 1, it holds that S =>*(„ ^ S • — S for each to (1 < m < n). If A: = 1, 

fcn+m 

then, by the definition, it holds that S =>*(„ ^ S • — S for each to (1 < to < n). 

n+m 

Suppose that, for some fc > 1, S ^ d) jS* ■ — S for each to (1 < to < n). Then, 
it holds that S S • • • S S -S' • • • S S • • • S = S • • -_S for 

/cn+(m — 1) /cn+(m— 1) n+1 (fe+l)n+m 

each TO (1 < TO < n). Hence, g{n,s,d) generates all strings of length more than 
n, so if h{n, s, e') = T, then S ^*(„ e'. □ 

Theorem 5. TuDFA ^pwm Af paren ■ 

Proof. Let Mi, . . . , Mr be DFAs with the same alphabet E {[,] ^ S) and with 
mutually distinct states. For each Mi = (Qi,E,Si,ql,Fi) {I < i < r), con- 
struct a parenthesis grammar Gi{n,s,Mi) = {Qi,SU {[,]}, Pj,gg) such that 
q — >■ [aSi{q, a)] G Pi for each q € Qi and a € S; q ^ [e] for each q G +, where e 
is an empty string. By using Gi{n,s,Mi), let PMi,...,Mr be the following set of 
productions for S ^ {Ui<i<rQi) U A U {[,]}: 

PMi,...,Mr = {S ^ [(?g] I ■ ■ • I [<Zo]} U {Dl<i<rPi). 

Then, construct /, g and h as follows: 
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f{n,s,eie 2 ---ei) = [[ei[e 2 [- • • [e;[e]] • • •]]]] for e* e S, 
g{n, s, {Ml, Mr}) = ({S'} U {'Jl<^<rQi), S U {[,]}, S, 

, _ Jei-'-e/ if e' = [[ei[e 2 [- • • [e/[e]] • • •]]]] and € T, 

n[n,s,e)-<^ ± otherwise. 

Note that L{g{n,s,{Mi, . . . ,Mr})) C {[[ei[e 2 [- • • [e^le]] • ■ ■]]]] \ m>l, a G 
S}, so if h{n,s,e') = _L, then S 7 ^g(„ ^ {Mi Mr}) obvious that 

'5' ^*g{u,s,{M,,...,Mr}) [[ei[e 2 [- ■ • [ei[e]] ■ ■ •]]]] iff dCa • • • e/ G L{M,) for some i (1 < 
i < r). Hence, it holds that £udfa ^pwm >Cparen- □ 

Sakakibara m has shown that £paren is polynomial-time learnable with 
membership and equivalence queries if the structural information is available. 
Furthermore, Sakamoto m has shown that £paren is polynomial-time learnable 
with membership queries and characteristic examples. The above theorem claims 
that the structural information or the characteristic examples are essential for 
efficient learning of £paren- 

4.2 Finite Unions of Regular Pattern Languages 

Since each regular pattern language is regular |2S|, we can construct a DFA Mt^ 
such that L{Mtt) = L(tt) for each regular pattern tt as follows: Suppose that 
7T is a regular pattern of the form tt = XQaiXia 2 ■ ■ ■ Xn-iUnXn, where Xi G X 
and ai = G Then, the corresponding DFA for tt is the DFA 

Mtt = (A, Q, S, go, F) such that: 

1- Q = {90,p{,---,Pmi,9l,Pl,---,Pm2>92,---,gn-l,P?,---,Pm„,gn} and F = 

{ } , 

2. S{qi, a) = p\^^ and S{qn, a) = qn for each a G E and 0 < i < n — 1, 

3. 6{pj,aj) = Pj+i and S{pln . , amj = for each 1 < i < n and 1 < j < — 1, 

4. S{p},a) = p\ for each a G E such that a}. 

It is obvious that \Mt^\ is bounded by a polynomial in |7 t|. 

By using the corresponding DFAs, we can easily shown that £rp <pwm ^dfa 
by constructing the following /, g and h for each regular pattern tt: 

f{n,s,e) = e, 
g{n,s,TT) = M„, 
h{n, s, e') = e' . 

Then, Arp is polynomial-time predictable with membership queries, which is im- 
plied by the result of Matsumoto and Shinohara uni that £rp is polynomial-time 
learnable with equivalence and membership queries. Furthermore, the following 
theorem holds: 

Theorem 6. For each m> 0, Tu^RP ^pwm ^dfa- 

Proof. Let TTi,...,TTm be m regular patterns. Also let = (Qi^ E,Si, q^, Fi) 
be the corresponding DFA of tt^. First, construct a DFA = {Qi x • • • x 

Qm,E,S, (g^, . . .,q{f),Fi x • • • x F^) such that i5((gi, . . . ,g™),a) = (pi, . . . ,pm) 
iff 5i{qi, a) = Pi for each i {1 < i < m). Then, construct /, g and h as follows: 
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/(n,s,e) = e, 

^(7Z,S,{7Ti, . . . , TTm}) = ,...,7r^ 5 

h{n, s, e') = e'. 

Recall that the size of g{n, s, {tti, . . . ,TTm}) is bounded by a polynomial in 
s, that is, 0(s”^). It is obvious that L(7Ti) U • • • U L{-k^) = which 

implies that /Iu^rp <pwm >Cdfa- □ 

For the unbounded finite unions of regular pattern languages, the following 
theorem holds from our previous result. 

Theorem 7 (Sakamoto et al. |22j). £dnf ^pwm^uRP- 

The idea of the proof of Theorem 0 in 122 ] is similar as one of Theorem 0 First, 
for each term tj (1 < j < m) in a DNF formula d = ti V • • • V over x\, . . . ,x^, 
construct a regular pattern ttj = wf ■ ■ ■ tt^ as follows: 



Furthermore, let tt be a regular pattern xi ■ ■ ■ XnXn+i- Then, we can construct 
/ and h as same as Theorem^ and g as g(n, s, d) = {tti, . . . , 7r,„, tt}. Note that 
L(7 t) = S* — because of disallowing ^-substitutions. 

For the unbounded finite unions of substring pattern languages, we obtain 
the following theorem. 

Theorem 8, TusubP ^pwra-^DFA- 

Proof. Let 7Ti,...,7rj. be substring patterns such that tt^ = XiWiyi. For a set 
{ici, . . . , Wr} of constant strings, consider the following modification of a pattern 
matching machine (pmm) 0. The goto function is defined as same 

as a pmm. For the state j transitioned from the right-most constant of Wi by 
the goto function, the failure function maps j to j itself; otherwise, the failure 
function is defined as same as a pmm. The output function is not necessary. 
Since this modified pmm is also a DFA, construct /, g and h as follows: 



Note that the size of g{n, s, {tti, . . . , 7Tr}) is 0(|wi| H h |wr|) Q- Further- 
more, it is obvious that L{g{n, s, {tti, . . . , = L(7Ti) U • • • U L{TTr), which 



Shinohara and Arimura PS| have discussed the inferability of TumRP> ^urp 
and TusubP in the framework of inductive inference. They have shown that 
£umRP ^usubP are inferable from positive data in the limit, whereas Turp is 
not. In contrast, by Theorem 0 Q and 0 Tu„rp and TusubP are polynomial-time 
predictable with membership queries, whereas £urp is not if neither are DNF 
formulas. 




/(n,s,e) = e. 



g{n,s,{TTi,...,nr}) = M, 
/i(n, 5, e') = e^ 



implies that /lusubP ^pwm -^dfa- 



'pwm 



□ 
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5 Conclusion 



In this paper, we have presented the results of pwm-reducibility on formal lan- 
guages described as Fig. 0 in Section 0 

The results in Section EH tell us that the efficient predictability of CFGs 
may be necessary to assume some deterministic application for productions. 
Ishizaka uni showed that the simple deterministic grammars are polynomial- 
time learnable with extended equivalence and membership queries. The extended 
equivalence query can check whether or not the hypothesis is generated by simple 
deterministic grammars. It remains open whether or not simple deterministic 
grammars are polynomial-time predictable with membership queries. 

Angluin 0 showed that £fc-bounded-CFG is polynomial-time predictable with 
nonterminal membership queries, and Sakakibara m extended Angluin’s re- 
sult to extended simple formal systems. In our previous work, we extended the 
pwm-reduction to prediction-preserving reduction with nonterminal membership 
queries partially m- It is important to investigate the properties of it in more 
detail. 

In Section 14.21 we only deal with the finite unions of regular pattern lan- 
guages. Many researchers such as j2liili;ill4llt)liYi:^ have developed the learn- 
ability/predictability of non-regular pattern languages. In particular, the learn- 
ability of the languages of k-variable patterns mm that contain at most k vari- 
ables, kpL-patterns im each of which variable occurs at most k times, and erasing 
patterns fTTFffi) that allow e-substitutions have been widely studied in the vari- 
ous learning frameworks. It is a future work to investigate the pwm-reducibility 
of them or their finite unions. 
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Abstract. In this paper dense families of database relations are 
introduced. We characterize dependencies and keys of relation instances 
in terms of dense families. Key functions are also introduced. They are 
isotone Boolean functions, which minimal true vectors are the charac- 
teristic vectors of the keys. We show that each dense family determines 
an isotone Boolean function which is the dual of the key function. We 
also show how to determine for a given relation r an r-dense family 
which size is at most m^/2— m/2-|-l, where m is the number of tuples in r. 

Keywords: Armstrong system; Dense family; Functional dependency; 
Key; Key function; Matrix representation. 



1 Introduction 

We begin with recalling some notions and results concerning relational databases, 
which can be found in j^, for example. A relation schema R{A) consists of a 
relation name R and a set of attributes A = {oi,...,a„}. The domain of an 
attribute a G A is denoted by dom(a). It is also assumed that the attributes in 
A have some certain fixed order. 

A relation r of the relation schema R{A) is a set of n-tuples r = {ti , . . . ,tm}- 
Each n-tuple t G r is an ordered list of values t = {vi, . . . , u„), where every value 
Vi, 1 < i < n, is a, member of dom(ai). 

Let R{A) be a relational schema and let r be a relation of the schema R{A). 
We denote by t[a] the value of attribute a G A in a tuple t G r. If B — {bi, . . . , bk} 
is a subset of A ordered by the order induced by the order of A, then t[B] denotes 
the ordered list of values {t[bi ], . . . , t[bk])- 

The concept of functional dependency between sets of attributes was intro- 
duced by Armstrong p. For any subsets B and C of A, a functional dependency , 
denoted by B — >■ C, is defined by the following condition: 

B ^ C ii and only if (Vti,t2 G r) ti[B] = t2[B] ti[C] = t2[C]. 

This means that the values of the B component of tuples uniquely determine 
the values of the C component. We denote by the set of all functional depen- 
dencies that hold in a relation r. 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. ISd-Q^l 2001. 

© Springer- Verlag Berlin Heidelberg 2001 
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For a relation r of a schema a set of attributes i? C A is a superkey 

oi r ii B ^ A G Fr- A key is a superkey with the property that removal of any 
attribute from C will cause C not to be a superkey any more. By definition, 
each tuple of a relation must be distinct. Thus, the values of a key can be used 
to identify tuples. Note that a relation may have several keys. 

Example 1. Let us consider the relation schema R{A), where the attribute set 
is A = {SSN, LNAME, FNAME, DEPT, AGE}. In Figure 1 is given a relation r 
of this schema. 



SSN 


LNAME 


FNAME 


DEPT 


AGE 


422-11-2320 

533-69-1238 

489-22-1100 

305-61-2345 


Benson 

Ashly 

Benson 

Davidson 


Barbara 

Dick 

Mary 

Dick 


Mathematics 
Computer Science 
Mathematics 
Computer Science 


Young 

Old 

Middle-aged 

Old 



Fig. 1. Relation r 



It is obvious that {SSN} is a key of r since two persons cannot have the same 
value for SSN. Note that (LNAME, FNAME} is also a key of r. 

In practice, functional dependencies and keys are viewed as properties of a 
schema] they should hold in every relation of that schema. Furthermore, keys 
and dependencies are usually specified by database designers. But as in the 
previous example, relations may have additional keys which designers are not 
necessarily aware of. Here we study dependencies and keys of a given relation 
instance. Especially, we consider the problem of checking the possible existence 
of unknown keys. This can be viewed as a data mining problem since our goal 
is to find new properties of data instances. Note that the problem of deciding 
whether there exists a key of cardinality at most k is NP-complete (see e.g. |B|). 

Moreover, the worst-case number of keys is ^ where n is 

the number of attributes (see e.g. where further references can be found). 

In this paper we introduce the notion of dense families. An r-dense family is 
a collection of subsets of A, which by applying certain condition induces the set 
Ft. We characterize dependencies and keys in terms of dense families, and show 
that generating all keys of a relation instance r can be reduced to generating all 
minimal transversals of a simple hypergraph minjA^ \ X G S,X A}, where 
S is any r-dense family. 

The author introduced in 0 matrices of preimage relations and here we 
present a similar matrix representation for database relations. We show that the 
family consisting of the entries of the matrix of a database relation r forms an 
r-dense family which size is at most rnf /2 — m/2 -|- 1, where m is the number of 
tuples in r. 

We introduce key functions which are isotone Boolean functions such that 
their minimal true vectors are the characteristic vectors of the keys. Note that 
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key functions are somewhat similar to the discernibility functions of information 
systems defined in We show that each r-dense family determines an isotone 
Boolean function which is the dual of the key function. This implies that the 
problem of checking whether there exists an unknown key can be reduced to 
problem of deciding whether two isotone Boolean functions are mutually dual. 
Furthermore, the problem of enumerating the set of all keys of a relation instance 
can be reduced to the problem of finding the dual of an isotone Boolean function. 

Note that in Fredman and Khachiyan showed that for a pair of isotone 
Boolean functions / and g given by their minimal true vectors minT(/) and 
mmT{g), respectively, we can test in time whether / and g are mutually 

dual, where k = |minT(/)| + |minT((;)|. The above result implies also that 
for an isotone Boolean function / given by its minimal true vectors and for a 
subset G C minT(/‘^), a new vector v G — G can be computed in time 

nfc°(i°g^), where fc = | minr(/)| + |G| (see e.g. for further details) . This means 
also that for any isotone Boolean function / given by its minimal true vectors, 
f‘^ can be computed in time where k = |minT(/)| + |minT(/‘^)|. 

Our paper is structured as follows. In the next section we give some funda- 
mental properties of Armstrong systems. In particular, we show that each family 
of subsets of A induces an Armstrong system. Based on this observation we in- 
troduce dense families of a relation. In Section 3 we study matrix representations 
of relation instances. The final section is devoted to key functions. 

We also note that some proofs are safely left as exercises for the reader. 

2 Armstrong Systems and Dense Families 

Let r be a relation of a schema R{A). In the following we denote Fr simply by F, 
that is, F is the set of all functional dependencies that hold in r. It is well-known 
(see 0, for example) that the following conditions hold for all B,C,D A: 

(Al) if B D G, then B ^ C G F- 

(A2) if B G e F, then (B U B) (G U B) e B; 

(A3) if B G e F and G ^ B e B, then B ^ D G F. 

Conditions (A1)-(A3) are called Armstrong axioms. 

Let A be a set and let F be a set of pairs B ^ G, where B,G C A. We 
say that the pair (A, F) is an Armstrong system, if A is finite and F satisfies 
conditions (A1)-(A3). Next we present some simple properties of Armstrong 
systems. 

Let {A, F) be an Armstrong system. We denote by B+ the union of all sets, 
which are dependent on B, that is, 

B+ = 1J{G C A I B ^ G e F}. 

It follows easily from Armstrong axioms that B+ is the greatest set dependent 
on B. It is also easy to verify that for all B, G C A: 

(Cl) B C B+; 

(C2) BCG implies B+ C G+; 

(C3) (B+)+ = B+. 
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Conditions (C1)-(C3) mean that the map p{A) — >• p{A),B H> i?+, is a closure 
operator. Here p{A) denotes the power set of A, that is, the set of all subsets of 
A. Furthermore, the family 



Cf = {B+ \ BCA} 



is a closure system. This means that is closed under arbitrary intersections, 
that is, for all T~L C Cp. Note that = X for all X G Cp. It is also 

obvious that 

B^ C e F C+ C B+. 

Let S C p{A) be a family of subsets of a A. We define a set Fs of pairs of 
subsets of A by the following condition: 



Fs = {B C l(VX € S) B C X ^ C C X}. 



Proposition 1. IfS is a family of subsets of a finite set A, then the pair {A, Fs) 
is an Armstrong system. 

In the previous proposition we showed that each collection of subsets of a 
finite set induces an Armstrong system. Next we introduce dense families of a 
database relation. 

Definition 1. Let R{A) be a relation schema and let r be a relation of that 
schema. We say that a family S C p{A) of attribute sets is r-dense (or dense in 
r) if F, = Fs. 

The problem is how to find dense families. Our next proposition guarantees 
the existence of at least one dense family. Recall that each Armstrong system 
{A^F) determines a closure system Cp. In the sequel we denote the closure 
system >C(Fr) simply by Lr. 

Proposition 2. The family Cr is r-dense. 

Next we present some fundamental properties of dense families. 

Proposition 3. If S is r-dense, then the following conditions hold for all 
B,C CA. 

(a) S C 

(b) B+ = f|{A eS\B<ZX}. 

(c) B^Cg Fr if and only if (yx G S) B C X ^ C C X . 



Remark 1. Note that by Propositions 2 and 3(a), Cr is the greatest r-dense 
family. Furthermore, Proposition 3(b) implies that each r-dense family generates 
the family Cr. 
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We end this section by studying keys of a relation by means of dense families. 
Recall that a set of attributes i? C A is a superkey oi r \i B ^ A & Fr, and a 
key is a minimal superkey. 

For any B C A, we denote by B^ the eomplement of B with respect to the 
set j 4, that is, = {a G A | a ^ B}. Let S be r-dense. It is clear that the 
following conditions are equivalent for all X G 5 and B C A: 

(Kl) i? is a superkey of r; 

(K2) B^ AGFr; 

(K3) B CX ^ X = A; 

(K4) X ^ A^ B %X; 

(K5) 

Note that the equivalence of (K2) and (K3) follows from Proposition 3(c). Since 
(Kl) and (K5) are equivalent, we can give our characterization of keys and 
superkeys in terms of dense families. 

Theorem 1. Let R{A) be a relation schema and let r he a relation of that 
schema. If S is r-dense, then the following conditions hold. 

(a) B is a superkey of r if and only if it contains an element from each set in 
{X^ \ X eS,X ^ A}. 

(b) B is a key of r if and only if it is minimal with respect to the property of 
containing an element from each set in \ X G S, X ^ A}. 



Remark 2. Note that an element a G A belongs to all keys if X^ = {a} for some 
X G S, where S is an r-dense family. 

A family R is of subsets of A is a simple hypergraph on A, if X C K implies 
X = Y for all X, y G R. A transversal B of a simple hypergraph R is a subset 
intersecting all the sets of R, that is, B fl X yf 0 for all X G B. Transversals are 
also called hitting sets. By Theorem 1(b) it is now clear that a set B is a key of 
r iff B is a minimal transversal of the hypergraph min{X^ | X G 5, X A}, 
where S is any r-dense family. 

The following algorithm, which can be found in 0, computes all minimal 
transversals of a hypergraph R in time 0((njfGW 1^1)^)- 

Algorithm Minimal Transversals. 

Input. A simple hypergraph R on A. 

Output. The family F of minimal transversals of R. 

r^{0}; 

for each X G R do 

T ■<— {y U {a:} I y G T and x G X}; 
r^{YGT\(yZGT) Z^Y}. 

In Section 4 we will present an another method for enumerating the keys. 
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3 Matrix Representation 

In this section we study certain matrices defined by database relations. We show 
that the set consisting of the entries of the matrix of a database relation r forms 
an r-dense family. 

Definition 2. Let R{A) be a relation schema and let r = {fi, . . . , tm} be a rela- 
tion of that schema. The matrix of the relation r, denoted by M(r) = {cij)mxm, 
is defined by 

Cij = {a€ A\U[a]= tj[a]}. 

The above definition means that the entry Cij is the set of all attributes a 
such that the tuples ti and tj have the same a-value. 

Now our following proposition holds. 

Proposition 4. If R{A) is a relation schema, r is a relation of that schema, 
and M{r) = {cij)mxm is the matrix of r, then the family 



{cij \ I < i,j < m} 



is r-dense. 

Proof Assume that r = {ti, . . . ,tm}- Let us denote S = {cij | 1 < i, J < m}. 
We show that Fj. = Fg. 

Suppose that B ^ C £ This means that for all tuples ti,tj £ r, ti[B] = 
tj[B] implies ti[C] = tj[C]. Let £ S and assume that B C dj. This means 
that ti[B] = tj[B], and so ti[C] = tj[C] by the assumption B ^ C £ Fr- Hence, 
C C dj and we obtain B ^ C £ Fg. 

On the other hand, let B ^ C £ Fg. Assume that ti[B] = tj[B] for some 
1 < i,j < This means that B C dj- But B ^ C £ Fg implies C C dj. 
Hence, also tJC] = tj[C] must hold and thus B ^ C £ Fr. □ 

It is easy to see that the r-dense family consisting of the entries of M (r) has 
at most m^/2 — m/2 -h 1 elements. 

Example 2. Let us consider the relation r presented in Fig. 1. We denote the 
attributes SSN, LNAME, FNAME, DEPT, and AGE simply by the numbers 1, 
2, 3, 4, and 5, respectively. Moreover, we denote the subsets of A which differ 
from 0 and A simply by sequences of numbers. For example, {1, 2, 3} is written 
as 123. 

The matrix M{r) is the following: 

/A 0 24 0 \ 

0 A 0 345 

24 0 A 0 

V 0 345 0 A / 

By Proposition 4 the family S = {0, 24, 345, A} consisting of the entries of M(r) 
is r-dense. For example, the dependency 23 — >■ A holds, since 23 and A are 
included in the same sets of S. 
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It is also easy to form the set Cr = {0, 4, 24, 345, A}. This is the closure 
system generated by the family S. 

Let us denote T = \ X G S,X A} = {A, 135, 12}. Obviously, 1, 

23, and 25 are the minimal sets with respect to inclusion which contain an 
element from each set in family T . This means by Theorem 1(b) that {SSN} 
and {LNAME, FNAMEj are keys of r, as we already mentioned. But we have 
found also a new key of r, namely {LNAME, AGE}. 



4 Key Functions 

In this section we consider key functions. First we recall some basic notions 
concerning Boolean functions (see e.g. j2), where further references can be found). 
A Boolean function is a map /: (0, 1}" — >■ (0, 1}. An element v G (0, 1}" is called 
a Boolean vector. If f{v) = 1 (resp. f{v) = 0), then v is called a true (resp. false) 
vector of /. The set of all true (resp. false) vectors of / is denoted by T{f) (resp. 

F{f))- 

Let A = (oi, . . . ,On} and B C A. The characteristic vector y(i?) of i? is a 
vector (ui, . . . , u„) such that Vi = 1 li Oi G B and = 0 if ^ B. 

Definition 3. Let R{A) be a relation schema, where A = |ai,...,a„}, and 
let r be a relation of that schema. The key function of r is a Boolean function 
fr- (0, 1}" — >■ (0, 1} such that for all B <G A, 

x{B) G T{fr) B is a superkey of r. 

Next we present some properties of key functions. Let u = (ui, . . . , m„) and 
V = (vi, . . . , Vn). We set u < v if and only if Ui < Vi for all 1 < i < n. A Boolean 
function / is isotone if u < v always implies f{u) < f{v). 

Let /: (0, 1}" — )> (0, 1} be an isotone Boolean function. A true vector v of 
/ is minimal if there is no true vector v' of / such that v' < v. We denote by 
minT(/) the set of all minimal true vectors. A maximal false vector and the set 
maxF(/) are defined analogously. 

Now the following obvious proposition holds. 

Proposition 5. Let R(A) be a relation schema, where A = |ai,...,a„}, and 
let r be a relation of that schema. The key function fr- (0, 1}" — >■ (0, 1} of r is 
an isotone Boolean function such that minT(/r) = |x(B) \ B is a key of r}. 

If / is an isotone Boolean function it is known that there is one-to-one corre- 
spondence between prime implicants and minimal true vectors. For example, an 
isotone Boolean function /: {0, 1}^ — ?> (0, 1}, / = 12 V 23 V 31, where 12 stands 
for xi A X 2 and so on, has prime implicants 12, 23 and 31, which correspond to 
minimal true vectors 110, Oil, 101, respectively. 

By Proposition 5, the minimal true vectors of a key function are exactly the 
characteristic vectors of keys. The problem is how to find the key function. It 
turns out that each dense family defines an isotone Boolean function, which is 
the dual of the key function. 
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The dual of a Boolean function /, denoted by is defined by = f{x), 

where / and x denote the complements of / and x, respectively. It is known 
that = / and that the DNF expression of is obtained from that of / 

by exchanging V and A as well as constants 0 and 1, and then expanding the 
resulting formula. For example the dual of / = 3 V 14 V 24 is /"^ = 3 A (1 V 4) A 
(2 V 4) = 34 V 123. It is well-known and obvious that for any isotone Boolean 
function /, 

minT(/) = {u | u G maxF'(/‘^)}. 

Let R{A) be a relation schema, where A = {ai,...,a„}, and let r be a 
relation of that schema. For any r-dense family of subsets S of r, we define an 
isotone function gg: {0, 1}” — >■ {0, 1} by setting 

min(55) = min{x(X^) \ X € S,X ^ A}. (4.1) 

Our next proposition shows that gg is the dual of the key function. 

Proposition 6. Let R{A) be a relation schema and let r he a relation of that 
schema. For all r-dense families S, 

fr = (gsf- 

Proof Let A = {ai, . . . , a„}. We will show that for all v G {0, 1}”, 

veT{fr) v&F{gs), 

which means that fr = {gs)‘^- 

Suppose that x{B) G T(/r) and assume that x(B) ^ F{gs). This means 
that x{B) G T{gs). Then by the definition of gs, there exists an X G 5 such 
that X ^ A and x(^^) < x{B) = x{B^)- This is equivalent to X^ C 
and B Q X. Since conditions (Kl) and (K4) are equivalent, this means that 
B ^ A ^ Fr- Thus, B is not a superkey of r and x{B) ^ T{fr), a contradiction! 
Hence, x(B) G F{gs). 

On the other hand, suppose that x{B) G F{fr). Because x{B) is a false 
vector oi fr, B ^ A ^ Fr, and hence there exists an X G 5 such that X ^ A 
and B C X. This implies that X*" C B^ and x(^^) ^ x(B^) = x(H). Because 

niin(g5) = min{x(X^) \ X € S,X ^ A}, 

we obtain x{B) G T{gs). □ 

Example 3. Let us continue Example 2. As we already mentioned, the family 
S = {0, 24, 345, A} consisting of the entries of M(r) is r-dense. By the definition 
of gs, 

mmT{gs) = min{x(A:'^) \X gS,X A} = {10101,11000}. 

The Boolean function g^: {0, 1}^ — >■ (0, 1} has a DNF expression 135 V 12, where 
12 stands for xi A X 2 , and so on. This implies that 

fr = {gsf = (1 V 3 V 5) A (1 V 2) = 1 V 23 V 25. 
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Hence, the key function fr has minimal true vectors 10000 and 01100, and 01001. 
These vectors are the characteristic vectors of {SSN}, {LNAME, FNAME}, and 
{LNAME, AGE}, respectively, and thus these sets are the keys of r. 

Conclusions. We studied dense families of relation instances and characterized 
dependencies and keys in terms of dense families. We studied certain matrices 
of database relations and showed that the family consisting of the entries of this 
kind of matrix is dense. We also showed how the problem of checking the possible 
existence of unknown keys can be reduced to the problem of deciding whether 
two isotone Boolean functions are mutually dual. 

A set of functional dependencies A is a cover of a relation r, if the set of 
functional dependencies which can be inferred from E by applying Armstrong 
axioms (AS1)-(AS3) equals E}. Obviously, we can use dense families to solving 
the problem of determining a cover of a relation instance. Namely, the set Cr 
can be generated by any r-dense family by using the operation intersection. We 
can also compute for any X G Cr the set key(A) of all keys of the relation 
r' , where r' is the relation r restricted to the schema R{X). Furthermore, if 
Lr = {Xi, , Xp}, then apparently the set 

E = {Bi -G- Xi \ Bi G key(Ai)} U • • • U {Bp -G Xp \ Bp G key(Ap)} 
is a cover of r. 
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Abstract. We investigate the complexity of basic decidable cases of the 
commutation problem for languages: testing the equality XY = YX for 
two languages X, Y, given different types of representations of X, Y. 
We concentrate on (the most interesting) case when Y is an explicitly 
given finite language. This is motivated by a renewed interest and recent 
progress, see (Ed. in an old open problem posed by Conway |2j. We 
show that the complexity of the commutation problem varies from co- 
N EX PTIM E-complete, through P-SPACE complete and co-NP com- 
plete, to deterministic polynomial time. Classical types of description are 
considered: nondeterministic automata with and without cycles, regular 
expressions and grammars. Interestingly, in most cases the complexity 
status does not change if instead of explicitly given finite Y we consider 
general Y of the same type as that of X . For the case of commutation of 
two finite sets we provide polynomial time algorithms whose time com- 
plexity beats that of a naive algorithm. For deterministic automata the 
situation is more complicated since the complexity of concatenation of 
deterministic automaton language X with a finite set Y is asymmetric: 
while the minimal dfa’s for XY would be polynomial in terms of dfa’s 
for X and Y, that for YX can be exponential. 



1 Introduction 

Research on word equations during the past few deacades has revealed several 
amazing results, most notably the Makanin’s algorithm and its extensions [1 
irn] . The situation changes completely when equations on languages (or even on 
finite languages) are considered. Very little, or in fact almost nothing, is known 
about their solutions HH- The exception being the case when such restricted 
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equations with two operations - union and concatenation - are considered where 
certain fixed point results become applicable. An example is the method to 
compute the rational expression for a given finite automaton, see m for a 
general survey. Such a theory, however, does not have a counterpart in word 
equations. 

The equation corresponding to the commutation problem is: XY = YX. 

Even this deceiptively simple equation proposes several natural and combinato- 
rially intersting problems. Recently there was a renewed interest in commutation 
problem due to the partial solution to the problem posed about 30 years ago by 
Conway 

Assume X is the maximal solution to XY = FA for a given finite set Y. 
Is the language A regular ? 

An affirmative answer has been given in for |F| < 3, but the case |F| > 3 
is still unresolved. It seems that further study of commuting with finite sets 
is needed. In this paper we study the following commutation problem for 
languages: 

Instance: Given descriptions of two formal languages A and Y, 

Question: What is the complexity of testing the equality X -Y = Y ■ X 1 

We can change the way we formulate a problem instance by changing the type of 
description: deterministic and nondenterministic finite automata (n/a’s, in short) 
with and without cycles, context-free grammars {cfg's, in short) and regular 
expressions. Even the exact complexity of the case of an explicitely given finite 
A is nontrivial. The set Y is in most cases an explicitely given finite language, 
unless it is specified explicitly otherwise. Then the size of Y is to be understood 
as the total length of its memeber words. Also a secondary size is considered: 
the cardinality of Y. 

We assume that throughout the whole paper A is a set of single symbols. Our 
next proposition is a special case of a theorem in nq. 

Proposition 1. Assume X fl A™ yf 0, e ^ A and AA = AA. Then A"* C A. 

Proof. Let Z = A™ fl A. We show by induction the following statement for each 
k < m: 

V(y S A^) 3(a: G Z) y is a prefix of x. 

This is clearly true for k = 1. Then, since AA C ZA we have that all words of 
size fc-|- 1, as they are prefixes of A A by inductive assumption, should be prefixes 
of words in A. 

Denote: 

= L^UL2uL^..UL" 

As immediate consequence of the proposition we have the following useful facts. 

Lemma 1. 

1. Assume L = A* or L = A* — {u} for some word u. Then LA = AL 
L = A*. 

2. Assume V(1 < k < m) L n A^ 0, e ^ L and L C A-". Then LX = XL 
L = A^". 
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2 Regular Expressions and Nondeterministic Automata 

Regular languages can be represented by regular expressions or Ufa's. Unfor- 
tunately the representations are not polynomially equivalent. For each regular 
expression R there is an polynomial size (with respect to the size of R) nfa that 
accepts the language represented by R and such an nfa can be constructed in 
polynomial time The opposite is not true: there is an nfa accepting a 

regular language L such that the smallest regular expression for L is of size 
exponential wrt. the size of the nfa |S|. Nevertheless, independently of the rep- 
resentation we prove that the commutation problem for a regular language and 
a finite language is P- SPACE complete. As a consequence we have that the 
commutation problem for two regular languages is also P-SPACE complete. 

Theorem 1. Let X be a regular language giuen by a regular expression. Then 
the eommutation problem for X and Y = X is P-SPACE hard. 

Proof. Let M be any deterministic Turing machine working in polynomial space. 
We assume that M stops on every input. Let be the size of maximal space 

used by M during the computation for a word of size n. Clearly, pm{p) < cn^ 
for some constants c and k. Let w be any word over {a, b}. A history of a 
computation of M on the word w is a word history(w) which is of the form 
ffwoffwiff . . . ffwmff where jwil = c\w\^ -\- 1 and Wi is a configuration of M at 
step i of the computation of M on the word w. Then wq = qowB^ where qo is the 
starting state of M and R is a blank symbol of the tape of M and j = cn^ — 1. 
Moreover we assume that after configuration Wm the machine M stops. 

Let S = {o, b, B} U Q where Q is the set of states of M. Now we construct 
a regular expression R of polynomial size on |w| such that R = S* if w is not 
accepted by M and R = S* — {history{w)} if w is accepted by M. Then the 
result is a consequence of Lemma Q 

The expression ii = i?i U i ?2 U i ?3 U i? 4 . R\ describes words which are not of 
the form ffuoffuiff . . . #Ufc# where Ui are of length c\w\’" 1 and they con- 

tain exactly one state of M. R 2 describes words which are not of the form 
S* qxuffu' q' S* where q, q' G Q, x G {a, b}, M being in state q changes the 
state to q' if x is the symbol under the head of M, \\xuffu'\ — cn^ \ = 1 and the 
exact length of xuffu' depends on whether M moves head to the left or right. 
i ?3 describes the words which are not of the form ffuau'ffwa where |u| = |w| 
and the symbol a is not close to a state q in uau' . R 4 describes the words which 
do not contain wq as a prefix or do not contain an accepting configuration as a 
suffix. 

Corollary 1. Let X, Y be regular languages given by nfa’s or regular expres- 
sions. Then the eommutation problem for X and Y is P-SPACE eomplete. 

Proof. By Theorem H it is enough to prove that the problem is in P-SPACE 
when X and Y are given by two nfa's. Since X and Y are regular, the languages 
XY and YX are regular, too. Moreover they representation as two nfa’s can be 
computed in polynomial time. Now it is enough to solve the equivalence problem 
for two nfa’s in P-SPACE. The latter problem is P-SPACE complete, see |n|. 
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As a straightforward consequence of Theorem Q and Corollary Q we have. 

Theorem 2 . Let X be a regular language given by an nfa or a regular expres- 
sion, and let Y be a finite language given explicitely or by acyclic nfa. Then the 
commutation problem for X and Y is P-SPACE complete. 



Theorem 3 . Let X be a finite language given by an acyclic nfa, and letY = S . 
Then the commutation problem for X and Y is co-NP hard. 

Proof. The problem is in co-NP because all words in XY and YX are of poly- 
nomial length so to prove that XY yf TA it is enough to guess a word which 
distinguishes these two languages and check that it belongs to one of the lan- 
guages and does not to the other. 

The proof of co-NP hardness is analogous to the proof of Theorem E We 
want to prove that checking XY ^YX is NP-hard. We take a nondeterministic 
Turing machine M which works in polynomial time. We assume that M stops 
for every input and every nondeterministic choice. For a given word w define a 
set HI STORY {w) being the set of all possible histories of computation of M 
on the word w. All words in HI STORY {w) are of polynomial size wrt to Itcl. 
Choose c and k such that 

VwVrt G HISTORY{w)\u\ < c\w\^ . 

Now from technical reasons we define 

HISTORY'{w) = {uB^ : u G HISTORY{w),j = c\w\'^ - |m|}. 

All words in HISTORY' {w) are of the same length c\w\^ which depends only on 
|ru|. Denote by Accept{w) the subset of HISTORY' (w) containing all accepting 
histories for w. Given a word w denote m = c|w|^. Now we construct a regular 
language R such that R = A™ if HI STORY {w) does not contain an accepting 
history for w and R = A™ — Accept{w) if HI STORY {w) contains an accepting 
history for w. The result is a consequence of Lemma ^ 

The expression R = U i?2 U i?3 U i?4. R\ describes words which are not of 
the form ffuoffuiff . . . ffukffB^ where Ui are of proper length and they contain 
exactly one state of M. i?2 describes words of length m which are not of the 
form E* qxuffu' q' E* where q, q' G Q, x G {a, b}, M being in state q changes 
the state to q' if x is the symbol under the head of M, \\xuffu'\ — cn^| = 1 
and the exact length of xuffu' depends on whether M moves head to the left or 
right. i?3 describes the words of length m which are not of the form ffuau'ffwa 
where |rt| = |w| and the symbol a is not close to a state q in uau' . i?4 describes 
the words of length m which do not contain tc as a prefix or do not contain an 
accepting configuration as the last configuration of the word. 



Corollary 2 . Let X be a finite language given by an acyclic nfa and Y be a 
finite language given explicitely or by an acyclic nfa. Then the commutation 
problem for X and Y is co-N P -complete. 
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Proof. We may assume that Y is given by an acyclic nfa. First we guess w in 
X and V in Y such that wv is not in KX or vw is not in XY. w and v are of 
polynomial size. Checking that a guess is correct can be done in polynomial time 
since the languages XY and YX are regular. 

Observe here that if we replace acyclic n/a’s by a star-free regular expressions 
then the complexities of considered problems are the same. 

3 Context-Free Grammars 

Context-free languages {cfl's, in short) are much more complicated than regular 
sets, in particular the commutation of an infinite cfl and a finite set is even 
undecidable 0. 

Proposition 2. Let X be a cfl given by a cfg and Y be a two element set. Then 
the commutation problem for X and Y is undecidable. 

Obviously the problem is decidable if X is a finite cfl. However a cfg can 
generate doubly exponential number of different words, even if we assume that 
L(G) is finite. However the lengths of words are only singly exponential. 

Lemma 2. Assume a cfg G, of size m, generates a finite language. Then the 
length of each word in L{G) is bounded by a singly exponential function with 
respect to G. 

Denote by co-NEXPTIME the class of problems whose complements are 
solvable in nondeterministic singly exponential time. Observe that the direct de- 
terminization of a problem in this class requires doubly exponential deterministic 
time. 

We use pushdown automata {pda’s, in short) instead of context-free gram- 
mars {cfg’s, in short). A pda with linear number of states can use its stack to 
count exponential numbers. 

Lemma 3. [Exponential counting.] 

1. There is a pda with 0{n) states which accepts the language 

Leg = {a^W ■■ l<f<2"} 

2. Assume ff ^ S, then there is a polynomial time pda accepting the finite 
language 

{x € {E U ff)* : jxj < 2'^", x contains a subword y G such that jyj 2"} 

Proof. Using its stack A can store a counter represented in binary, with the 
least significant digit at the top of the stack. The height of the stack is n, except 
intermediate steps. The stack of such height can store all numbers n the range 
1 < * < 2". We can add and subtract one from the counter by changing the stack 
appropriately. For example to add one, we go down the stack to find the first 
zero. We change this zero to one and replace erased ones by zeros. We remember 
in the state how deeply we entered into the stack and we can restore it. Hence 
the finite control of the pda keeps the position of a binary digit to be changed, 
and the stack stores the whole numbers. 
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Lemma 4. For each pda A with n states, which at each step places on the stack 
a constant-size string, there is a polynomial-size cfg generating L{A) 

Theorem 4. Assume we are given a cfg G generating a finite set. The problem 
of testing LS = SL is co-N EXPT I M E-hard. 

Proof. 

We show that there is a short grammar generating all sequences of exponen- 
tial size which are invalid histories of accepting computation of a Turing ma- 
chine working in nondeterministic exponential time. We can use the ideas from 
Lemma 0 to construct a pda A accepting all strings of size at most except 
valid accepting histories. Assume the accepting history is a string of the form 
ffwiffw 2 #W 3 . . . ffwt, where t = 2" and IwiI = 2”, Wi G E* , and ff ^ E. 

The pda can use the stack for counting. It guesses which of consecutive codes 
Wi, Wi+i of configurations (of not necessarily the same size, but bounded by an 
exponential function) do not correspond to a valid move of the Turing machine. 
Assume Wi and rci+i differ at position j. A puts the number j in binary on 
the stack (still keeping below the representation of i) and records the mismatch 
symbol. Then it skips nondeterministically at most 2"’ symbols of Wi and finds 
the j-th position in Wi+i using the stack. Then it skips at most 2" positions in 
Wi+i. It does not check if the number of skipped positions in Wi and Wi+i is the 
same, but keeps track of the exponential bound of skipped position. 
Nondeterministically A also acceptes the language from the second point of 
Theorem 0 for the constant c = 3. 

Eventually the pda accepts all strings of length at most 2'^", if there is no 
valid history of computation in 2" time of the Turing machine. If there is a valid 
computation then A acceptes some string of length at most 2°", and at the same 
time it does not accept some string of a same length. We can use Lemma0 Due 
to Lemma 0 A can be transformed into an equiavelent cfg of a polynomial size. 

Corollary 3. Let a finite language L be given by a context-free grammar G and 
letY be a finite language given explicitely or by an acyclic nfa or by a context-free 
grammar G' . Then the commutation problem for L andY is co-NEXPTIME- 
complete. 

Proof. It is enough to prove that the problem is in co-NEXPTIME so it is 
enough to prove that the problem LY ^ TL is in NEXPTIME. We may 
assume that Y is given by a cfg. First we guess words w in L and v in Y such 
that wv is not in TL or vw is not in LY. By Lemma 0 the word wv is of length 
at most exponential wrt the sizes of G and G' and YL is context-free so checking 
whether wv is in TL can be done in exponential time. Similarly we can check 
whether vw is in LY. 

4 Deterministic Finite Automata 

For deterministic finite automata the complexity status of the commutation 
problem is open. This is caused by the assymetry of the complexity of concate- 
nating a dfa language by a finite set from the left and from the right. 
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We have partial results, which suggest that the problem would be PSP AC E- 
complete. Let us consider the problem of the complexity of a language of the 
form L' = L{A) ■ Y , where y is a finite language, measured in the number of 
states of a minimal dfa accepting L' . Observe that if we take L{A) = E* and 
Y = 1- then A can have only one state, the automaton for Y has 0{k) states 
but the smallest dfa for L' needs an exponential number of states. This happens 
because the total size of Y is exponential wrt. k. The situation is much different 
when the total size of Y is part of the input size. In this case L' is accepted 
by a dfa of a polynomial size. However for the language Y ■ L{A) the situations 
is dramatically different. Our next theorem can be found in CHI. We present a 
proof which is more suitable for our considerations. 

Theorem 5. 

1. Assume A is a deterministic automaton and Y is a finite language. Then we 
can construct in polynomial time a deterministic automaton accepting L{A) ■ Y , 
which has a polynomial number of states. 

2. There are dfa’s A and finite languages Y, where the size of A and Y together 
is n, such that the number of states of any minimal dfa accepting Y ■ L(A) is 
exponential. 

Proof. 

1. Polynomial-size dfa for L{A) ■ Y 

Let qq be an initial state of A and ACCa be the set of its accepting states. We 
construct the automaton A which for an input string x remembers (in its finite 
control) the pair (q,z), where: 

z is a longest suffix of x which is a prefix of a word in Y ; 
q = i5a(9o: y), where y is a prefix of x such that x = yz. 

The automaton A' acceptes iff g G ACCa and z gY. We omit the details. 

2. Exponential-size dfa for Y ■ L{A) 

Consider the language 

L^{n) = {V^%uffv : u, uG{a, 6}", {AY < i < n) Ui ^ Vi} 

It is easy to see that a dfa for L^(n) needs an exponential number of states, on 
the other hand we can write 

L^{n) = 1 ” • (Li U L2 U L3 . . . U Ln) 

where each Li is accepted by a linear size dfa, Li is ”in charge” of testing in- 
equality on the f-the position. Define the languages: 

L = 1 • Li U f2 • L 2 U 1^ • Lg . . . 1" • 
and Y = 1U12u13...1”UL” 

The language L is a accepted by a linear size dfa (combination of dfa's for 
Li’s). The corresponding automaton simply reads the number of leading ones, 
before the symbol ” $” . Then depending on this number switches to simulate the 
corresponding dfa for Li. Then 

(y-L)n(i”$-r*) = L^{n). 

Hence any dfa for Y ■ L should have exponential number of states. Otherwise 
L^ (n) would have a small dfa, as a product of the dfa for Y ■ L and linear size 
dfa for 1"$ • r*. 
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Theorem 6. 

Let A be a dfa. 

1. IfY is an explicitly given finite language then we can test the inclusion 
Y ■ L(A) C L(A) ■ Y in deterministic polynomial time. 

2. If Q is a finite prefix set that is given explicitely then we can test the equality 
Q ■ L{A) = L{A) ■ Q in deterministic polynomial time. 

Proof. Due to Theorem El we can construct a deterministic automaton A' for 
L{A) ■ Y with polynomial number of states. Let A" be the complement of A' . 
We also construct a nfa C for Y ■ L{A) with polynomial number of states. Then 
it is enough to check L{C) H L{A") = 0. This can be done by combining C 
and A" into a single nfa D with polynomial number of states (as a product of 
automata). Hence we can test L{C) fl L{A") = 0 by checking if L{D) = 0. This 
problem is reducible to a path problem in a graph. 

The second point follows from the fact that Q ■ L{A) is a language accepted by 
a polynomial size dfa. 

Observe that the complexity of all previously considered problems is deter- 
mined by the complexity of the problem EL = LE. For instance if X, Y were 
regular then the commutation problem for X and Y is PSPACE-complete. Thus 
it is of the same complexity as the commutation problem in which X is reg- 
ular and Y = E. Similar situatoin is with other our problems considered in 
previous chapters. By point (2) of Theorem El the problem if EL = LE for a 
regular L given by a dfa can be solved in polynomial time. This suggests that 
the commutation problem for languages given by dfas is in P. 

We have however strong arguments that it is not the case. In the following 
we will try to convince the reader that it is unlikely that equality Y ■ L{A) = 
L{A) ■ Y , for finite Y , and a dfa A, can be tested in polynomial deterministic 
time. 

Theorem 7. Let A and B be dfa’s and Y an explicitly given finite language. 
Testing the inclusion L{B) C Y ■ L{A) is PSPACE-complete. 

Proof. Let M be a deterministic Turing machine M working in space m for the 
input string w of size n. We construct a dfa accepting non-valid computations 
of M for w similarly as we constructed a dfa for the language L^(m). 

We define a sequence of languages Li, L 2 , ... Ln which together ’’test” that 
two consecutive configurations of M are invalid. This can be done in a similar 
way as for the language L^{n). We omit the details. Eventually we construct 
the language L such that 

WSA* C y • L iff there is no valid accepting computation of M. 

The language can be accepted by a dfa with n-\- 2 states. 

5 Finite Languages 

Let X be an explicitely given finite language. Denote by |X| the number of words 
in X and by ||A|| the total size of all words in X. 



On the Complexity of Decidable Cases 201 



Lemma 5. 

1. Let X be a finite set of words over a non-unary alphabet S then |X| < 

ll^ll 

l°g|E| ll^ll ’ 

2. Let X and Y are two finite languages over a finite alphabet S and let |X| = n 

and |y| = TO. Then < to||X|| + n||y||. 

Proof. Fix the number N and consider a set of words X such that 

1. \\X\\<N, 

2. the cardinality of X is largest possible, 

3. among all sets Y having properties Dl and 0 the number ||X|| is smallest 
possible. 

Consider a trie T for X. The trie has |X| distinguished nodes and each path from 
the root of T to distinguished vertex of T is labelled by a word in X. Clearly 
each leaf of T is distinguished. 

Claim. 1. r is a balanced tree. 

2. Any son of a vertex with less than lAI sons in T is a leaf. 

3. Each vertex in T is distinguished. 

Proof, (of the claim) 

dU Suppose that the difference between depths of two leaves of T is at least 
2. Then we remove the deeper one and put it as a son of the other one. In this 
way the cardinality of the obtained set is the same as the cardinality of X and 
its size is strictly smaller than ||A||. A contradiction. 

m Suppose there is a vertex u in T having less than E sons which is not a 
leaf. Then we remove a leaf whose ancestor is v and put it as a son of v. Again 
the set represented by a new trie has the same number of elements as X and is 
of smaller size. A contradiction. 

(0 Since each leaf is distinguished, we may assume that an internal vertex 
u of r is not distinguished. Then we take any path from u to a leaf, remove v 
and move all other vertices of the path one level up. New trie corresponds to a 
set of the same cardinality as X and smaller size. A contradiction. 

By the claim we may assume that T is a full |A|-ary tree of |A| nodes. Such 
a tree has at least |A|/2 leaves and is of height at least log|j;| |A|. Hence, 

^log|^l |X|<||A||. 

This completes the proof of point O- 
Proof of point @ : 



\\XY\\ < E^^x,y€Y\xy\ < Sx(^xEy(zY{\x\ + |y|) < E^(zx{m\x\ + 
< X^ex(.m\x\ + ||h"||) < to||A|| + n||y|| 
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Theorem 8. Let X and Y be explicitly finite languages over a constant size 
alphabet. Then the commutation problem for X and Y can be solved in 0(|^| ■ 
||F|| + |T| • ||X||) time. In particular if the total size of the languages is n then 
the problem can be solved in time. 

Proof. First we construct a trie for the language XY. Then for each word in 
YX we check whether it is in XY. Next we construct a trie for the language 
YX and for each word in XY we check whether it is in YX. 

The estimation is a consequence of Lemma d 

When most of the words in X, Y are long words then a better measure of the 
input size is a pair {k, n), where n is the total length of all words in X UY and 
k is the number of words. Usually k « n. 

Theorem 9. Let X and Y be finite explicitly given languages of total size n and 
cardinality at most k. Then the commutation problem for X and Y can be solved 
in 0{{kf + n) log n) time. 

Proof. Denote W = XUY. We use the concept of the dictionary of basic factors, 
denoted DBF{W), for the set W of words, see The subwords of words in 
W whose lengths are powers of two, are called basic factors. The dictionary 
DBF{W) assigns to each subword w an integer name{w) G [1, . . . , n] such that 
each basic factor w is uniquely identified by the pair {lengthfw) , namefw)) . 

The following fact has been shown in |2j . 

Claim. DBFfW) can be constructed in time O(nlogn). 

For words whose length is not necessarily a power of two define 
codefw) = {length{w),name{w'),name" {w)), 
where w' , w' are respectively the largest prefix and the largest suffix of w whose 
length is a power of two. 

For a set X of words denote: code(X) = {code(a;) : x G X}. 

We replace the sets X and Y hy X' = code{X), Y' = code{Y). for two codes 
x', y' of words x, y denote by x' ®y' = z' , where z' is the code of xy. 

Claim. Given a dictionary DBF{X U Y), for any two x' , y' G code{X U Y) we 
compute x' ® y' in O(logn) time. 

It is easy to see that: 

X' ®Y' = Y' ®X' ^ XY = YX 

Now the algorithm is very simple. We compute X' ® Y' and Y' ® X' and test 
equality of two sets consisting of triples of short integers. We can do it by sorting 
these sets lexicographically and testing equality of sorted lists. 

6 Related Questions 

We end with posing some open problems. What is the complexity of commutation 
problem for X and Y if 
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1. X and Y are regular languages given by dfalsl 

2. X is given by a deterministic dfa and Y is finite? 

3. X is finite and given by a deterministic dfa and Y is finite? 

4- X is given by a dpda and Y is finite? 

5. X is finite and given by a dpda and Y is finite? 

6. X is regular and Y = S where size of S 

is a constant (|y| < k for a given fc)? 
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Abstract. A complete characterization of cones, semi- AFPs and AFPs 
(abstract families of power series) of algebraic power series in terms of 
algebraic systems is given. 



1 Introduction 

Berstel P states on page 267 that “up to now, no characterization of the family 
of context-free grammars generating the languages of a cone is known” . In this 
paper this problem is solved not only for (principal, full) cones but also for 
(principal, full) semi- AFPs and AFPs. 

In several papers, the author has characterized cones of algebraic power series 
(Kuich [titij 1 and abstract families of algebraic power series (Kuich, Karner P) 
in terms of algebraic systems. In Kuich, Salomaa jH|, Theorem 13.24, cones of 
algebraic power series are characterized in terms of pushdown automata. 

In this paper, we give a complete characterization of cones, semi-AFPs and 
AFPs of algebraic power series in terms of algebraic systems. The paper contains 
this and another two sections. In Section 2, we show that each principal full cone 
(resp. principal cone) of algebraic power series has a full cone generator (resp. 
cone generator) that is the first component of the unique solution of an algebraic 
system in a certain normal form, called cone form. This leads to a characteri- 
zation of principal (full) cones by the first components of the least solutions of 
certain algebraic systems given in matrix form. An easy extension of this result 
yields a characterization of (full) cones. In Section 3, these characterizations are 
used to characterize (principal, full) semi-AFPs and AFPs. 

The reader is assumed to have a basic knowledge of formal power series. 
All notions and notations used in this paper are from Kuich, Salomaa 0 or 
Kuich 0. 

2 Cones of Algebraic Power Series 

In this paper, denotes a fixed infinite alphabet. All finite alphabets E, possi- 
bly provided with indices, are subalphabets of E^o- In the following constructions 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 204- 17171 2001. 
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we sometimes assume without loss of generality that Saa contains “special sym- 
bols” (t, $ or not belonging to any of the finite alphabets S discussed in the 
same construction. 

A multiplicative morphism fi : S* ^ is called rational 

representation. It is called regulated iff there exists a A: > 1 such that, for all 
w € E* with |w| > k, = 0. A rational transducer {with input alphabet 

E and output alphabet E') T = {Q, pc, S, P) is given by 

(i) a finite set Q of states, 

(ii) a rational representation pi : E* ^ 

(iii) S G called initial state vector, 

(iv) P G called final state vector. 

The mapping ||T|| : A{{E*)) — >• A{{E'*)) realized by a rational transducer 1 = 
{Q, pL, S, P) is defined by 

ll'^ll(^)= XI X ’^91 92^92 > TGA{{E*)). 

The rational transducer T is called regulated iff ^ is a regulated rational repre- 
sentation. A mapping r : A{{E*)) — >■ A{{E'*)) is called a rational transduction 
(resp. regulated rational transduction) iff there exists a rational transducer (resp. 
regulated rational transducer) T such that r{r) = ||T||(r) for all r G A{{E*)). 
Our rational transductions are a straightforward generalization of the rational 
transductions of Berstel (Q, Proposition III. 7. 3 to power series. 

Here and in the sequel, A always denotes a commutative semiring. If we deal 
with rational representations or rational transducers that are not regulated, A 
will be additionally continuous. This makes sure that all infinite sums that occur 
in connection with representations and rational transducers are well-defined. 

Our basic semiring will now be A{{E^)). The subsemiring of A{{E^)) con- 
taining all power series whose supports are contained in some E* is denoted by 
A{{A^}}, i. e., 

A{{17^}} = {r G A{{E^)) I there exists a finite alphabet E C E^o 

such that supp(r) C A*}. 

For E C Aoo, A{{E*)) is isomorphic to a subsemiring of A{{A^}}. Hence, we 
may assume that A{{E*)) C A{{A^}}. Furthermore, we define two subsemirings 
of A{{A^}}, namely, the semiring of algebraic power series by 

^aig{{^^}} _ g A{{A^}} I there exists a finite alphabet E C Eoo 

such that r G 

and the semiring of rational power series by 

^rat{{j^^}} _ jj. g A{{A^}} I there exists a finite alphabet E C E^o 

such that r G A"-=^*((A*))}. 
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Let T = {Q, fj,^ P) be a rational transducer with input alphabet S. The 
extended mapping ||T|| : ^{{27^}} — >• yl{{17^}} is then defined by 

Qi^Q2^Q wGI!* 

Here the rational representation /i is extended to fj, : — >• 

by n{w) = 0 if m ^ E*. Hence T translates only words w of E* and maps 
words not in E* to 0. Again, a mapping r : — >• ^{{27^}} is called a 

(regulated) rational transduetion iff there exists a (regulated) rational transducer 
T such that r(r) = ||T||(r) for all r € A{{!7^}}. 

Each nonempty subset £ of A{{27^}} is called family of power series. In the 
sequel, £ will denote a family of power series. A family of power series closed 
under rational transductions (resp. regulated rational transductions) is called a 
full cone (resp. cone). Define 

S0I(£) = {r(r) | r G £ and r is a rational transduction} 



and 



9Il(£) = {r(r) | r G £ and r is a regulated rational transduction} . 

Since rational transductions and regulated rational transductions are closed un- 
der functional composition (Kuich 0, Theorem 7.9), £ is a full cone iff £ = ®i(£) 
and £ is a cone iff £ = 9Jl(£). A cone £ is called principal full (resp. princi- 
pal) iff there exists a power series r G A{{i7^}} such that £ = 9H(r) (resp. 
£ = 97l(r)). The power series r is then a full cone generator of the full cone £ 
(resp. cone generator of the cone £). (See Berstel P; and Ginsburg |2], where 
cones are called trios.) Again, if we deal with full cones our semiring A will be 
commutative and continuous. 

We now want to characterize full cones and cones of algebraic power series 
by algebraic systems in cone form. Here an algebraic system is in cone form iff 
it is of the form 

y% — ^ ^ ^iTT^iTr"^ : 1 ^ f ^ ^ ? 

Tvez 

where G A, = 1, XiT^ G A, tt G ^ U U K U {e}, \ < i < n,Y = 
{Vi\^<'i< n}, and {xi^ \ n € Z - {j/J, 1 < z < n} n {xiy^ | 1 < z < n} = 0. 

In a series of theorems and lemmas we will prove that with respect to the 
generation of principal cones the cone form is a normal form. 

Theorem 1. A mapping r : A({E*)) — ^ A{{E'*)) is a rational transduction (resp. 
regulated rational transduction) iff there exists an ro G A''‘‘‘((27'*)), a rational 
representation (resp. regulated rational representation) p, : E* — >■ (AlfE'*)))^'^^ 
and i,t & Q, i ^ i, such that, for all r G A((I7*)), 

r(r) = (r,e)ro + ^ {r,w)p{w)^^t ■ 
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Proof. Use the construction in the proof of Theorem 9.4 of Kuich, Salomaa |H|. 
It works for both, regulated rational transductions (with commutative semiring 
A) and rational transductions (with commutative continuous semiring A). The 
proof of Theorem 9.4 of Kuich, Salomaa |E| is valid in both cases. g 

We often will use the following corollary of Theorem 1 without further men- 
tion. 

Corollary 21 Let t : — >■ he a rational transduction (resp. 

regulated rational transduction) . Then there exists a rational representation p : 
— >■ and i,t & Q, i t, such that, for each r G A{{U^}} 

with (r, e) = 0, 

T{r)= {r,w)p{w)^^t = p{r)i,t- 

w^U* 

Lemma 22 Each principal full cone (resp. principal cone) has a quasiregular 
full cone generator (resp. cone generator). 

Proof. By Theorem 11.31 of Kuich, Salomaa |S|. □ 

Lemma 23 Let h : S* — ^ (U U <t)* be the morphism defined by h{x) = x<^. for 
all X G S. Then, for r G A{{S*)), ®l(r) = dJl{h{r)) and Tl{r) = 9Jl(/i(r)). 

Proof. By Lemma 13.18 of Kuich, Salomaa 0- □ 

Lemma 24 Let h : E* — >■ (ULIct)* be the morphism defined by h{x) = a;(|: for all 
X G S* . Then, for all r G A{{S*)) with (r,e) = 0, there exists a strict algebraic 
system yt = Qt, I < i < n + I, where supp(gi) C E{yn+i} U E{yn+i}Y U 
E{yn+i}Y'^ , 1 < i < n, and Qn+i = such that h{r) is the yi-component of its 
unique solution. 

Proof. By Theorem 14.33 of Kuich, Salomaa 0) is the first component of the 
unique solution of an algebraic system yi = pi, 1 < z < n, in Greibach normal 
form with supp(pi) C U U EY U EY"^: 

J2i<j,k<nJ2xesiPi’^yk)xyjyk, 1 < z < n. 

Assume that a is the unique solution oi yi = pi, 1 < i < n. We claim that h{a) 
is the unique solution of the algebraic system 

yi ~ '^xes (k, x)h{x) + J2xes(Pi^ xyj)h{x)yj + 

Y.i<j,k<uY.x^AP^^xyjyk)h{x)yjyk, l<i<n. 

The claim is shown by substituting h{a) into this algebraic system: 

J2xesiP^^x)h{x) +J2i<j<nJ2xes(Pi^^y3)Hx)Haj)+ 

Y.i<j,k<n Y.xes(Pi> xyjyk)h{x)h{aj)h{ak) = 
h{pi{ai,...,an)) = h{ai), l<i<n. 
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By Lemma 14.25 of Kuich, Salomaa 0, (h{r),h{a 2 ), ■ ■ • , h{an),t) is the unique 
solution of the algebraic system with variables in {yi , . . . , j/„, 

Vi = T.xesiPi^x)xyn+l+J2l<j<nT.xeEiPi^Xyj)^yn+iyj + 

Y.l<j,k<nJ2a:eEiP^^^yjyk)xyn+iyjyk, l<l<n, 
yn+1 ~ ^ ■ 



Lemma 25 Let h : S* — >■ (I7U(t)* be the morphism defined by h{x) = a;(t for all 
X € E* . Then, for all r € A{{E*)) with (r,e) = 0, there exists a strict algebraic 
system 

y% ^ ^ •! 1 ^ ^ ^ ^ 5 

irez 

where OiT^ G A, aiy. = 0, € E , n € Z , Z = {e} U F U U 1 < * < n, 

such that h(r) is the first component of its unique solution. 

Proof. By Lemma 5, h{r) is the first component of the unique solution of a strict 
algebraic system yi = Pi, 1 < i < t, where supp(pi) C <s^\JE{Y — yi)[JEY'^yjEY^: 

Vi = 

J2xesiP^^ xyjyk)xyjyk+ 

Y.i<j,k,m<tY.x^si.P^^xyjykym)xyjykym, l<i<t. 

Consider now the strict algebraic system with variables in {yf \ x & E, 1 < i < 

t} 



{Pi,xyjyk)xy]yf.+ 

Y.i<j,k,m<tT.x^si.P^^xy^ykym)xy]ylyf^, l<i<t, xGE. 

We claim that the yf -component, a; G 27, 1 < i < t, of the unique solution is 
given by ai. This claim is easily shown by substituting this unique solution into 
the equations of the above algebraic system. 

Since (j>i, xyi) = 0 for a; G 27, 1 < i < t, our lemma is proven. g 

Lemma 26 Let r : 2l((27*)) — >■ >1((27U$)*)) be the substitution defined by t(x) = 
$*a:, a: G 27. Then, for all r G A{{E*)), ®l(r) = OT(r(r)) and 37l(r) = 97l(r(r)). 

Proof. Clearly, for r G A{{E*)), r{r) G 37l(r). Hence, 9H(r(r)) C Tl{r) and 
s5l(T(r)) C dk{r). Conversely, define the substitution r' : H(((27U$)*)) — )> A{{E*)) 
by t'{x) = X, X G E, r'($) = 0. Then r'(r(r)) = r for all r G A{{E*)). Hence, 
OJl(r) C 3Jt(r(r)) and OT(r) C f5l(r(r)). g 

Lemma 27 Let a be the unique solution of an algebraic system yi = pi, 1 < i < 
n, where supp(pi) C EY* . Let t : A{{E*)) — >■ H(((27 U $)*)) be the substitution 
defined by t{x) = $*a;, a: G 27. Assume that a' is the unique solution of the 
algebraic system yi = Pi + %yi, 1 <i <n. Then a' = t((t). 
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Proof, (i) We prove that r(cr) is the unique solution of the system yi = $*pi, 
1 < * < n: 

$*Pj(r(cri), . . . ,t(ct„)) = 

ExGi:EaGy*(P*’a:a)$*a;a(r(cri), . . . ,r(cr„)) = 

Exgi: EaGV (Pi^ xa)T{xa{ai, cr„)) = 
r(pi(cri, . . . ,CT„)) = T(cTi), l<i<n. 

(ii) Consider the linear system 

yi = hi+Pi{fx[,...,(j'J, l<i<n. 

By Theorem 4.17 of Kuich, Salomaa 0, its unique solution is given by 
{Vpi{a[, a'J, $*pn{cr [, . . . , a'J) . 

But {a[,. . . , a!^) is also a solution. Hence, ct- = $*pi{a[, . . . , a'„), 1 < i < n. 

(iii) Since the system yi = $*pi, 1 < z < n, has a unique solution, we infer 

that a' = r(cr). g 

Theorem 2. Each principal full cone (resp. principal cone) in has 

a full cone generator (resp. cone generator) that is the first component of the 
unique solution of an algebraic system in cone form. 

Proof. By Lemmas 3-6, each principal full cone (resp. principal cone) has a full 
cone generator (resp. cone generator) that is the first component of the unique 
solution of a strict algebraic system 

Vi ^ ^ ^i'TrXiTr'^ j 1 ^ Z ^ n , 
nez 

where aijr € A, aiy. = 0, XiT^ £ 17, tt G Z, Z = {e} U F U U F^, 1 < z < n. 
By Lemma 8 we can add to the z-th equation of this algebraic system the term 

1 < z < rz, and get a first component of the unique solution of the new 

system that is again a full cone generator (resp. cone generator) of the considered 
principal full cone (resp. principal cone). 

Our theorem is proved by the observation that the new algebraic system is 
in cone form. g 

Corollary 28 Let £ C A‘^*s{{17^}}. Then £ is a principal full cone (resp. prin- 
cipal cone) iff it has a full eone generator (resp. eone generator) that is the first 
component of the unique solution of an algebraic system in cone form. 

Let © be an algebraic system y^ = pi, 1 < i < n, in cone form, i. e., 

p, = {Pi,Xi)Xi-i-J2l<j<niPi^^ijy])^Ly3 + 

X/i<j,fc<n(P£ Xijkyjyk)xijkyjyk~\~ (*) 

^2l<j,k,m<niP'i''’ ^'i'jk'myjykym)Xijkmyjykymi 1 ^ ^ ZZ , 

where Xi,Xij,x^k,Xijkm G {p%,Xuyf} = 1, 1 < i,j,k,m < n, and {{xi \ 1 < 
z < n} U {xij I 1 < z,j < n, j ^ i} U {xijk \ 1 < ij,k < rz} U {xijkm \ 
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1 < i,j, k,m < n}) fl {xu | 1 < i < n} = 0. Then, by definition, S&1(6) is the 
collection of all the following algebraic systems written in matrix form: 

^^Kj^k^m<niP^^ ^ p{^ijk7n)^j^k^7n: 1 ^ i ^ 77 . . 

Here = (y*fc)i<yfe<tj l<i<n, isatx t-matrix of variables and /r is 
a rational representation defined by /i : 27 — >■ {A{S' U e))*^‘, t > 1. More- 
over, 0 denotes the Kronecker product (see Kuich, Salomaa [B|)- Furthermore, 
lDt(6) is the collection of all algebraic systems defined above where the matrices 
^i{xi), ^{xij), fi{xijk), p{xijkm) satisfy the following additional condition: 

There exists an s > 1 such that each product of s of these matrices is 

quasiregular. (Hence, if the matrix M is such a product then (M, e) = 0.) 

The y]^(-component of the least solution of the algebraic system defined above 
is called the principal component of this least solution. We are now ready to 
characterize principal full cones (resp. principal cones) in H®''®{{27^}} by the 
principal components of the least solutions of the algebraic systems in 371(6) 
(resp. 371(6)). 

Theorem 3. Let r be the first component of the unique solution of an algebraic 
system 6 in eone form. Then the principal full cone 37t(r) (resp. principal cone 
37l(r ) ) coincides with the collection of all principal components of least solutions 
of the algebraic systems in 371(6) (resp. 37l(6)J. 

Proof. Let 6 be given by the equations (*). Define the algebraic principal cone 
type (see Kuich, Salomaa 0, page 281) ({?/i, 27, T, j/i) by 

Fy,7r^7T — {Pi^Xi'jXi.f 

“ {PiT ^ijyj)^ij 1 
'^yi'’^.vjVkTr — {PiT Xijkyjyk)xijkj 

.yjykVm'^ — i.Pi-!^ijkmyjykym)^ijkm-} 

1 < fj,k,m < n, 7T G {j/i, Then, by Theorem 10.7 of Kuich, Salo- 
maa 0, the unique solution of 6 is given by {{T*)y.^i;)i<i<n- Hence, r = (T*),^^^^. 

Consider now the algebraic system (**), where /i is a regulated rational rep- 
resentation. Since ((T*)y.^e)i<i<„ is the unique solution of (*), substitution of 
p{T*)y.^s for Yi, 1 < i < n shows that {y{T*)y.^e)i<i<n is a solution of (**). By 
Theorem 10.7 of Kuich, Salomaa 0, it is the unique solution of (**). 

We first prove the case of the principal cone. Assume that s G 371(6). Then, 
by Corollary 2, there exists a regulated rational representation /r' : 27* — 
(^rat(( 27 /*)))t xt ^ f > 2, such that s = pt'{r)i^t'- Hence, s = y,' {(T*)y^^g)i^t' ■ 
Observe that T is a loop type (see Kuich, Salomaa 0 , page 226). Hence, by 
Theorem 11.46 of Kuich, Salomaa 0 , there exists a regulated rational represen- 
tation y defined by /x : 27 — >• (A(27' U e))*^*, t > t', such that ((/x'(T)*)yj^e)i_(/ = 
((^(T)*)j^j_e)i_(/. Since t' > 2, renaming yields s = {{y{T)*)y^^e)i,t and s is the 
principal solution of an algebraic system in 371(6). 
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We now show the converse. Let s be the principal component of the unique 
solution of (**). Then s = i > 1, and s S SH(r). 

We now consider the case of the principal full cone. Given a rational 
representation : E* — >■ t' > 1, let /r' be the regu- 

lated rational representation fj,' : E* ^ (A''®'* (((A'' U { 2 })*)))* defined by 
H'{x) = {^i"{x),e)z + J2ujes'+ zc)'ic, X C E^ where z C E^ — E^ . Let 

h : {E' U {z})* — >■ E'* be the morphism defined by h{x) = x, x G E' , and 
h{z) = e. Then, for each r € A{{E*'f), we obtain ^"{r) = h{fj,'{r)). 

Assume now that s G 9Jt(6). Then there exists a regulated rational rep- 
resentation : E* ^ (A’^^*(((E' U {z})*)))*'^*' such that s = = 

As above there exists now a regulated rational representation 
fi defined by ^ : A — t> (A((E' U {z}) U e)Y^* such that s = h{{^{T)*)y^^Y}i,t- 
By Theorems 7.10 and 7.4 of Kuich is the least 

solution of (++). Hence, s is the principal solution of an algebraic system in 
371(6). Conversely, let s be the principal component of the least solution of (**). 
Then by Theorems 7.10 and 7.4 of Kuich 0, s = {^{T*)y^^Y)i,t = Hence, 

seOT(r). □ 

Corollary 29 £ C jg ^ principal full cone (resp. principal cone) 

iff £ coincides with the collection of all principal components of least solutions 
of the algebraic systems in 371(6) (resp. 371(6) j for some algebraic system 6 in 
cone form. 



For A = B this is a characterization of principal full cones and principal cones 
by context-free grammars. 

Example 1. The restricted one counter languages form a principal full cone that 
is generated by the Lukasiewicz language (see Greibach 0, Berstel P). These 
restricted one counter languages are generalized by Kuich, Salomaa P to power 
series. Again these restricted one counter power series form a principal full cone 
and again it is generated by the solution of the algebraic system y = x\yy + X 2 . 
Clearly, this principal full cone is also generated by the solution of the algebraic 
system y = x\yy + xy + X 2 in cone form. Hence, a power series is a restricted 
one counter power series iff it is the i/it-component of the least solution of an 
algebraic system Y = MiYY + MY + M2. Here K is a t x t-matrix of variables 
Vjk, ^ < j,k <t and M, Mi, M 2 G {A{E U e))*^*, t > 1. □ 

We now characterize full cones and cones of algebraic power series by alge- 
braic systems in cone form. 

Theorem 4. £ C A’^*s||y 7 ^|| ig a full cone (resp. cone) iff there exist algebraic 
systems &i, i G I, in cone form such that £ coincides with the collection of 
all principal components of least solutions of systems in 371(6^) (resp. dJl{&i)), 
i G I. 

Proof. We first consider the case of the full cone. Let £ = ®T(£'). Then £ = 
UrG£' ®l(r). Hence, our theorem is proved by Corollary 12. 

The proof for the case of the cone is analogous. □ 
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3 Semi-AFPs and AFPs 

Let £ be a family of power series. Then S(£) (resp. S(£)) is the smallest sub- 
monoid of -I-, 0) that is closed under rational transductions (resp. reg- 

ulated rational transductions) and contains £. The notation ^(£) (resp. !?(£)) is 
used for the smallest rationally closed subsemiring of A{{27^}} that is closed un- 
der rational transductions (resp. regulated rational transductions) and contains 
£. 

A family £ of power series is called a full semi-AFP or a full AFP (resp. 
semi-AFP or AFP) iff £ = ©(£) or £ = ^(£) (resp. £ = S(£) or £ = 5'(£)). 
Here AFP stands for “abstract family of power series”. 

If there exists an £' such that £ = OT(£'), £ = S(£') or £ = §'(£') (resp. 
£ = 9H(£'), £ = ©(£') or £ = !?(£')) then £' is called full eone, full semi- 
AFP or full AFP generating family (resp. cone, semi-AFP or AFP generating 
family). If £ is a (full) cone, a (full) semi-AFP or a (full) AFP then £ is always 
a generating family. 

If £' contains only one power series r G A{{ 17^}} then £ is called principal full 
semi-AFP or principal full AFP (resp. principal semi-AFP or principal AFP). 
In this case r is termed full semi-AFP or full AFP generator (resp. semi-AFP or 
AFP generator) . In this case we write £ = &{r) or £ = ^(r) (resp. £ = ©(r) or 
£ = !?(?'))• For A = B, our operators 6, © and ^ coincide with the operators 

S, T, S and T of Ginsburg [2|, respectively. 

We first consider full semi- AFPs and semi- AFPs. The next lemma is proved 
analoglously to Theorems 11.31 and 11.35 of Kuich, Salomaa |BI by help of 
Lemma 7. 

Lemma 31 If Z' is a full semi-AFP (resp. semi-AFP) generating family of the 
full semi-AFP (resp. semi-AFP) £, then 

Z” = { I Tfc G £', 1 < fc < m, m > 1} 

is a full cone (resp. cone) generating family of Z. 

We now will characterize full semi- AFPs and semi-AFPs of algebraic power series 
by algebraic systems of a certain form. 

Let © j , j G J, be algebraic systems in the cone form with equations y( = p(, 
1 < i < Uj. An algebraic system is called semi-AFP system iff its equations are 

yo = (tyf H h -h $ 2 / 0 , yi'‘ =pI\ 1 < ■* < rij^ , 

for some G J, m > 1, and is denoted by [©jj , . . . , ©j„,]. Let rj 

be the first component of the unique solution of 6j, j G J. Then the yo~ 
component of the unique solution of the semi-AFP system [©j^ , . . . , ©j^] is 
given by J2i<k<m A, • ■ ■ , jm G J, m > 1. Observe that a semi-AFP sys- 

tem [©jj , . . . , ©j„] is in the cone form. Hence, the collections ®i([©jj , . . . , &j„,]) 
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(resp. , &j^])) are defined. The equation yo = H +$2/o 

gives rise to a matrix equation 

Yo = H b , 

where Yq = {y^jk)i<j,k<t is a t x t-matrix of variables and /i is a rational 
(resp. regulated rational) representation defined by ^ : A" — )> {A{S' U e))*^b 
By definition, the j/Jj-component of the least solution of an algebraic system in 
97l([©jj, . . . , &j^]) (resp. fOl([©j^, . . . , ©j„])) is called the principal component 
of this least solution. 

Theorem 5. Let 6j, j G J, be algebraic systems in the cone form. Letrj, j G J, 
be the first component of the unique solution of 6j . Then the full semi- AFP 
I j £ (fssp. semi-AFP 6{{rj \ j G J})) coincides with the collection 
of all principal components of the least solutions of the algebraic systems in 
(resp. for allji,...,jm G J, m > 1. 

Proof. We only proof the case of the semi-AFP. The proof for full semi- AFPs is 
analogous. 

Lemma 14 implies 

I j e J}) = 0Jl({ ^ I ji, ...,jrn&J, rn > 1}) . 

l<fc<m 

Since X)i</c<m j/o-component of the unique solution of the semi- 

AFP system'~[©jj , . . . , &j^], ji,. . . ,jm € J, m> 1, the collection of all principal 
components of the least solutions of the algebraic systems in 37t([©jj, . . . , &j„]) 
coincides, by Theorem 11, with I Ji; ■■■Jm ^ J, m > 1}). 

The equality 

G J, m> 1}) = IJ fm({ Y 

l</c<m m>l l<fc<m 

proves our theorem. g 

Corollary 32 £ C is a full semi-AFP (resp. semi-AFP) iff 2, coin- 

cides with the collection of all principal components of the least solutions of the 
algebraic systems in 2H([©jj , . . . , &jm\) (resp. 9H([©jj , . . . , &jm\) )i jii ■ ■ ■ j jm C 
J, m > 1, for some algebraic systems &j, j G J , in the cone form. 

Since every principal full cone (resp. principal cone) is a full semi-AFP (resp. 
semi-AFP), the characterization of principal (full) cones in Section 2 is valid 
also for principal (full) semi-AFPs (see Kuich, Salomaa [B|, Theorem 11.26, and 
Berstel P3> Example V.4.2). 

We now consider full AFPs and AFPs. The next lemma is proved analoglously 
to Theorem 11.31 and Corollary 11.41 of Kuich, Salomaa |BI by help of Lemma 7. 
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Lemma 33 If £' is a full AFP (resp. AFP) generating family of the full AFP 
(resp. AFP) Z, then 

£" = {( ^ $*(trfc)*$*# I rfc € £', 1 < A < m, m > 1} 

l<k<m 

is a full cone (resp. cone) generating family of Z. 

We now will characterize full AFPs and AFPs of algebraic power series by alge- 
braic systems of a certain form. 

Let 6j , j € J, be algebraic systems in the cone form with equations y( = pj, 
1 < i < Uj. An algebraic system is called AFP system iff its equations are 

yo = 2/0 H h + $yo + #, yl'° l < * < , 

for some ji, . . . , jm G J, m > 1. This AFP system is denoted by |Sjj, . . . , SjVn]. 
Let rj be the first component of the unique solution of ©j, j G J. Then the 
2 /o-component of the unique solution of the AFP system |6ji, . . . , &j^\ is given 
by (Si<fc<m ji, - ■ ■ , 3 m & J, rn>l. Observe that an AFP system 

|©jj, . . . , ©j^] is in the cone form. Hence, the collections OT(|©jj , . . . , ©j^]) 

(resp. 2H(|©jy , . . . , ©j„,])) are defined. The equation yo = (tyf yoH hdy^yo + 

$yo -I- # gives rise to a matrix equation 

Po = p{i)Yl^Yo + • • • + + y($)>b + y(#) , 

where Yq = {y^i^)i<j^k<t is a t x t-matrix of variables and /r is a rational 
(resp. regulated rational) representation defined by ^ : A — )> {A{S' U e))*^L 
By definition, the yj^-component of the least solution of an algebraic system in 
9Jl(|©jj, . . . , ©j,„]) (resp. 9}l(|©jj, . . . , ©j„,])) is called the principal component 
of this least solution. 

Theorem 6. Let 6j, j G J, be algebraic systems in the cone form. Let rj, 
j G J , be the first component of the unique solution of &j. Then the full AFP 
i?({0 I 3 S (i"ssp. AFP 5'({rj | j G J})) coincides with the collection 
of all principal components of the least solutions of the algebraic systems in 
9Jl(|©ji,...,©jv„l) (resp. Tl{l6j^,...,ej^l))forallji,...,jm GJ,m>l. 

Proof. Similar to the proof of Theorem 15; but use now Lemma 17. g 

Corollary 34 £ C jg ^ f^n AFP (resp. AFP) iff Z coincides with 

the collection of all principal components of the least solutions of the algebraic 
systems in OT(|©jj , . . . , ©j^]) (resp. 97t(|©j^, . . . , ©j-^])j, ji,...,jm G J, m> 
1, for some algebraic systems &j, j G J , in the cone form. 

The last case to be considered is the case of principal full AFPs and princi- 
pal AFPs. The next lemma is proved analogously to Theorem 11.42 of Kuich, 
Salomaa [B| by help of Lemma 7. 
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Lemma 35 Ifr G A{{i7^}} is a full AFP (resp. AFP) generator of the principal 
full AFP (resp. principal AFP) £ then ($*(|;r)*$*# is a full cone (resp. cone) 
generator of Z. 

We will now characterize principal (full) AFPs of algebraic power series by 
algebraic systems of a certain form. Let © be an algebraic system in the cone 
form with equations j/i = Pi, 1 < i < n. An algebraic system is called principal 
AFP-system (with basis 6) iff its equations are of the form 

2/0 = <t2/i2/o + %o + #, Vi=Pi, l<i<n. 

This principal AFP-system is denoted by |©]. Observe that a principal AFP- 
system is a special case of an AFP-system. Hence, the next two results are 
corollaries of Theorem 18. 

Corollary 36 Let 6 be an algebraic system in the cone form. Let r be the first 
component of the unique solution of&. Then the principal full AFP §(r) (resp. 
principal AFP U(r ) ) coincides with the collection of all principal components of 
the least solutions of the algebraic systems in 91l(|©]) (resp. 9Jl(|©])j. 

Corollary 37 £ C is a principal full AFP (resp. principal AFP) iff 

£ coincides with the collection of all principal components of the least solutions 
of the algebraic systems in 2H(|©]) (resp. 9H(|©])^ for some algebraic system © 
in cone form. 

Example 2. The one counter languages form a principal full AFL that is gen- 
erated by the Lukasiewicz language (see Greibach 0, Berstel P). These one 
counter languages are generalized by Kuich, Salomaa jS| to power series. These 
one counter power series form a principal full AFP that is again generated by 
the solution of the algebraic system y = Xiyy + X 2 . Clearly, this principal full 
AFP is also generated by the solution of the algebraic system y = xiyy + xy+X 2 . 
Hence, a power series is a one counter power series iff it is the 2;it-component of 
the least solution of an algebraic system 

£ = M3YZ + MiZ + Mg , 

Y = MiYY + MY + M2 . 

Here Z, Y are t x f-matrices of variables Zjk, yjk, ^ Y j,k < t, respectively, and 
M, Ml, M2, M3, M4, Mg G {A{E U t>l. □ 
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Abstract. We consider a new kind of machines with a circular tape and 
moving in one direction only, so-called Circular Post machines. Using 2- 
tag systems we construct some new small universal machines of this kind. 



1 Introduction 

In 1956 Shannon introduced the problem of constructing very small universal 
(deterministic) Turing machines. The underlying model of Turing machines is 
defined by instructions in form of quintuples (p, x, y, m, q) with the meaning 
that the machine is in state p, reads symbol x G S, overwrites it by y, moves by 
m G {—1,0, 1}, and goes into state q. Another equivalent model is defined by 
quadruples (p, x, a, q) where a G YU{ — 1, 0, 1}. This model is also equivalent to 
so called Post machines m- Whereas the quintuple model allows to construct 
equivalent machines with 2 states this is impossible for the quadruple model 

Let UTM(to, n) be a class of universal Turing machine with m states and n 
symbols. It was known that there exist universal Turing machines in the follow- 
ing classes: UTM(22,2), U TM(1 0,3), UTM(7,4), UTM(5,5), UTM(4,6), 
UTM(3,10), UTM(2,18) 11711^ and the classes UTM(2,2) UTM(2,3), 

and UTM(3,2) [T^ are empty. Recently it was shown that there exist universal 
Turing machines in the classes UTM(19,2) 0 and UTM(3,9) 0, improv- 
ing the previous results UTM(22,2) and UTM(3,10)- The last results reduce 
the number of classes UTM(m,n) with an unsettled emptiness problem (i.e if 
UTM(m,n) is empty) from 49 to 45. 

We introduce (deterministic) Circular Post machines (CPM). These are sim- 
ilar to those presented in [2| , with the difference that the head can move only in 
one direction on the circular tape. It is also possible to erase a cell or to insert 
a new one. We consider 5 variants of such machines, distinguished by the way 
a new cell is inserted. In |0I it has been shown that all variants are equivalent 
to each other, and also to Turing machines. We also show that for all variants 
there exist equivalent Circular Post machines with 2 symbols, as well as with 2 
states. 

To construct small universal Circular Post machines we use a method first 
presented in ^ (see also jlbllYj l. This method uses tag systems 0 which 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 217- 17^ 2001. 

@ Springer- Verlag Berlin Heidelberg 2001 
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are special cases of monogenic Post Normal systems namely of the form 
SiVU — > uai with v G and k > 1 a constant. In m it is also shown that 

2-tag systems (i.e. k = 2) suffice to simulate all Turing machines, with halting 
only when encountering the special symbol sh- 

Since Circular Post machines are also monogenic Post Normal systems we 
expect to get a more natural simulation of tag systems and perhaps smaller 
universal machines. 

Circular Post machines may be useful for studying the formal models of 
biocomputing where the DNA molecules are present in the form of a circular 
sequence PI- 

In a previous article jO] it was shown that such machines can simulate all 
Turing machines, and some small universal circular Post machines have been 
constructed, namely in the classes UPM0(7, 7), UPM0(8, 6), UPM0(11, 5), 
and UPM0(13, 4). In this article we present some new results on circular Post 
machines, and more universal machines, namely UPM0(4, 18), UPM0(5, 11), 
UPM0(6, 8), UPM0(9,5) (improvement of UPM0(11, 5)), UPM0(12, 4) 
(improvement of UPM0(13, 4)), and UPM0(18, 3). Here UPMi(m,n) de- 
notes the class of universal circular Post machines of variant i with m states and 
n symbols. 



2 Definitions 

Here we introduce some variants of circular Post machines. 



Definition 1. A Circular Post machine is a quintuple (A, Q, qo, q/, P) with a 
finite alphabet S where Q is a the blank, a finite set of states Q, an initial state 
qo G Q, a terminal state q/ G Q, and a finite set of instruetions of the forms 
px — >■ q (erasing of the symbol read) 
px — >■ j/q (overwriting and moving to the right) 
pO — ^ 2 /qO (overwriting and creation of a blank) 

The storage of such a machine is a circular tape, the read and write head 
moving only in one direction (to the right), and with the possibility to cut off a 
cell or to ereate and insert a new cell with a blank. 

This version is called variant 0 (CPMO). Note that by erasing symbols the 
circular tape might become empty. This can be interpreted that the machine, 
still in some state, stops. However, in the universal machines constructed later, 
this case will not occur. 

In this article it will assumed that all machines are deterministic. 

There are variants equivalent to such machines p]. 

Variant CPMl : The instructions are of the form 
pa; — >■ q pa; — >■ yq pa; — a;qO (0 blank) 
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Variant CPM2 : The instructions are of the form 
px — >■ q px — yq px — ?> ?/q0 (0 blank) 

Variant CPM3 : The instructions are of the form 
px — )■ q px — )■ ?/q px — >■ yzq. 

Variant CPM4 : The instructions are of the form 
px — >■ q px — >■ yq px — ?> yxq. 

□ 



3 Basic Results 

Here we present some new basic results on circular Post machines. The first 
results show that there are no universal circular Post machines with either only 
one state or one symbol, respectively. 



Theorem 1. CPM’s with one symbol 0 simulate finite automata only. 

Proof. Since there is only one symbol the instructions are of the form pO — >■ q, 
pO — qO, or pO — >■ qOO. These are essentially productions of Right Regular 
(Biichi) systems and therefore generating only regular sets. 

□ 



Theorem 2. CPM’s with one state have decidable word, emptiness, and finite- 
ness problems. 

Proof. Since there is only one state, such a CPM can also be interpreted as an 
EON system (context-free normal system) with productions of the form x — >■ A, 
X — >■ 2/, or X — >■ yz. Moreover, these systems are deterministic since there is only 
one productions for one symbol. For languages generated by such systems the 
following inclusions hold 0 : 

DON C EDON C EDTOL , ON C EON C ETOL. 

Therefore, the problems mentioned above, are decidable. 

□ 

In the previous theorem ON denotes the class of languages generated by 
context-free Post Normal systems, i.e. rewriting systems with productions of the 
form au — >■ ua. Such systems are called deterministic, denoted by DON, if for 
each symbol a there is at most one production. The letter E denotes the fact 
that the generated language consists only of words from a subalphabet of termi- 
nal symbols. Finally, EDTOL and ETOL denote the classes of (deterministic) 
context-free Lindenmayer systems with tables and subalphabet of terminal sym- 
bols |2D1. 
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Lemma 1. Each Circular Post machine can be simulated by another one with 
the cireular tape always of length > 2. 

Proof. Encode any w G S* by AAw with A ^ E, and add the instructions 
pA Ap. 



In P was shown 

Proposition 1. For any CPMi (i = 2,3,4) there exists an equivalent CPMi 
with 2 states (excluding the halting state). 



Here we prove it also for z = 0, 1. 

Theorem 3. For any CPMi (i = Q,l) there exists a simulating CPMi with 2 
states. 

Proof. By Lemma 1 it suffices to consider CPMi’s with the tape always of length 
> 2. Only the third form of instruction has to be considered. 

Simulation of pO — ?> t/qO. 

Length 2 : 



□ 



□ 




Length 3 : 



AoiQi ^ A0Q2i ^ lAoQQ ^ ^loQQ ^ AoiQQ ^ 

^ (D ^ ^ 1^0® 0 ^ ^10® 0 ^ 

^ ^OsiQ) ^ 2A6 sQ) ^ (t)idsQ) ^ (^)disQ) ^ 

(o)o^iQ (A)2(o)6^^ 

If ^ ^ ^ ^ <f^'l20s ^ 





Length > 3 : as for length 3. 



Instructions used 
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Is — si 


2s s2 


Is — >■ si 


2s (o)l 


lA A1 


2^^(o)l 


10 ^ 01 


20^(o)l 


K) ^ 


20^(,+i)l 0<y<n 


K) ^ (/r)2 


2Q ^ (°)2 


i(;) ^ (;)2 




10^^20 


2(o)^01 


10 ^ 01 


20^(o)l 


iQ ^ (/i)2 
l(p) ^ (p2 


Hi) ^ 


l(o) 




^ (,!i) 





Simulation of pa; — j/qO. 

Replace (°) by (“), (°) by (“), A by X, (^) by (^), as well as l(°) ^ ^2(°) 
by l(o) -t ^2(°), and l(p) -)> (^)2 by l(^) -)> (^2 in the previous part. 

□ 



For completeness we also mention P : 

Proposition 2. To every CPMi (i = 0, l,2,3,4j there exists an equivalent 
CPMi with 2 symbols. 



□ 



4 Universal Circular Post Machines 

Here we present a number of new universal circular Post machines of variant 0. 
In the tables yq stands for pa; — >■ yq, y for pa; — >■ yp, q for pa; — q, yqa: for 
pa; — >■ yqa;, and H for the unique halting state. 

UPM0(4,18) 





Ice 


b 


a g d h f X 


y p r s 


t 


11 


z 


q 


1 


2 3 


b 


a g d 


p4y p r s 


t 


u 


q 


q 


2 


Ice 


hi f xl H h f X 


r4y p r s 


t 


u 


d 


q 


3 


3 4 


b 


a g dl b a g 


e I c a 


b 


g 






4 


pry 


t 


s u z h f X 


y p r fl h2 d2 g3 d3 
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Ni = 1, Nk+i = Nk + mk {1 < k < n), iV„+i = + m„ - 1 

blank : y 

Encoding of symbol Si : c 

Encoding of of o;„ : a^^^b- ■ ■ 

Separators : e, d 

The initial configuration is 

e&a^“6- • • • ■■ga^"^b-- ■ • • c/^»c . 

In the first stage is read, altogether Nr b’s and g’s are changed into h’s 
and x’s, respectively, the a’s in between into /’s, and I^''cl^‘c is erased, giving 
ehf^^^h- ■ ■ • • • xa^'~^b- ■ ■ ba^’~'^(’'^ g ■ ■ ■ • • • cl^'“c. 

In the second stage, starting with 4, I^*c- ■ ■ cl^'“c becomes p^*r ■ ■ ■ rp^'“r, 
e is changed into y, d into z, a^’'^ b - ■ ■ g into t - ■ ■ u and copied 

as p^’~^r ■ ■ ■ at the end of p^*r ■ ■ ■ rp^'“r. 

In the third stage, starting with 4, both parts of the tape are restored, and 
a new cycle may start. 

The machine stops if in the first stage 2 encounters d. 

□ 



UPM0(5,11) 



Ni = 1, Nk+i = Nk + nik {1 < k <n) 
blank : y 

Encoding of symbol Si : /^*c 
Encoding of at : a^'^b- ■ ■ ba^*'^‘-*^g 
Separators : e, d 



The initial configuration is 

6 • • • y- ga^"^ 5 • • • gdll ^- c • • • c/^“ c . 

In the first stage is read, altogether Nr b’s and g’s are changed into h’s 
and x’s, respectively, the a’s in between into f’s, and I^''cl^‘c is erased, giving 
ehf^^’^h- ■ ■ ft,/^i"»(i)x • • • xa^'~^b- ■ ■ ba^’~'^(’'^ g ■ ■ ■ ba^"”'(’''i d4I^* c ■ ■ ■ cl^'“c. 

In the second stage, starting with 4, c • • • c/^“ c becomes /^‘ h - ■ ■ hf^'" h, 
e is changed into y, - ■ ba^’’’^'-^^ g into - and copied as 

fNri h. . . hf^r-m,(,r ) /j ^t the eud of h - ■ ■ hf^“ h. 

In the third stage, starting with 3, both parts of the tape are restored, and 
a new cycle may start. 

The machine stops if in the first stage 4 encounters d. 

□ 





Iceb a g dhfx y 


T 


23 bagd2cl c3y 


2 


I c e h3 f x3 h f h4y 


3 


34 bagdlbage 


4 


f h y hi f5 g3 H h f X y 


5 


bag d h f f4y 
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UPM0(6,8) 





I C 


e 


b 


a 


d h f 


T 


2 3 c3e b3 a6 


c I 


2 


I c 


e 


h3 


f 


h f 


3 


3 4 


e 


b 


a 


dl b a 


4 


f h 


e 


hi f5 


H h f 


5 




f4e 


b 


a 


d h f 


6 




h4e 


b 


a 


d h f 



The initial configuration is 



Ni = 1, Nk+i = Nk + nik + l {I < k <n) 

blank : e 

Encoding of symbol Si : I^*c 
Encoding of at : 

Separators : e, d 



6 • • • bb--- bba^"^ 6 • • • bbdll^- c • • • c/^» c . 

In the first stage is read, Nj. b’s are changed into hs, the as in between 
into f’s, and I^''cl^‘c is erased, giving 

ehf^^^h ■ ■ ■ hh--- hha^-^ 6 • • • ba^-’^(’-) g--- d4/^* c • • • c/^“ c. 

In the second stage, starting with 4, c • • • c/^“ c becomes /^‘ h - ■ ■ hf^'“ h, 
the part b - ■ ■ bb is changed into h--- hb and copied as 

/i at the end of f^*h- ■ ■ hf^“h. 

In the third stage, starting with 3, both parts of the tape are restored, and 
a new cycle may start. 

The machine stops if in the first stage 4 encounters d. 

□ 



UPM0(9,5) 



Ni = 1, Nk+i = Nk + 2mk - 1 {I < k <n) 
blank : d 

Encoding of symbol Si : I^'c 
Encoding of : a^*^bb- ■ ■ 

Separators : d, bbc 





I 


c 


a 


b 


d 


T 


2 


4 


a7 


b4 




2 


I 


c 


I 


c3 


d 


3 




cl 


a 


b 




4 


4 


5 


00 


H 




5 


I 


c 


16 


cl 


d 


6 


I 


c 


a 


b 


I5d 


7 


I 


c 


a 


b 


c5d 


8 


I 


c 


a 


b 


c9d 


9 


a 


b 


a 


b3 


d 



The initial configuration is 



c?6o^“56 • • • bba^^"‘(^^ b ■ ■ ■ ba^"^^bb ■ ■ ■ • • • cl^'“c . 

In the first stage is read. Nr b’s are changed into c’s, the a’s in between 
into /’s, and is erased, giving 

dcl^^^cc ■ ■ ■ c • • • bb - • ■ b - • • bba^’'"' 



(”) bbbcbl^* c • • • c/'^” c. 
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In the second stage, starting with 5, the part a^''^bb- ■ ■ bba^'~^‘-'~'> b becomes 
and is copied behind I^*c- ■ ■ as 

In the third stage, starting with 9, the instruction part of the tape is restored, 
and a new cycle may start. 

The machine stops if in the first stage 5 encounters bbb. 

□ 



UPM0(12,4) 



Ni = I, Nk+i = Nk + nik + 1 {I < k <n) 
blank : a 

Encoding of symbol Si : I^'c 
Encoding of c • • • cc 

Separators : o, bcb 





I 


c 


a 


b 


T 


2 


5 






2 


I 


c 


a3 


bl 


3 


b 


a4 


a 


b 


4 


I 


c 




b2 


5 


5 


6 






6 


I 


c 


a7 


bA 


7 


b8 a9 


a 


b 


8 


I 


c 


I6a 


b 


9 


10 cA aB 


H 


0 


I 


c 


c6a 


b 


A 


I 


c 


c9a b6 


B 




c4 


c 


I 



The initial configuration is 



acl^^^c- ■ ■ • • cc/'^"^c- • • ccbcbll^’' c ■ ■ ■ cl^"c . 



In the first stage is read, c’s are changed into a’s, the /’s in between 
into 5’s, and is erased, giving 

aab^^^a- ■ ■ • • • aal^’’^c- ■ ■ cc ■ ■ ■ ccbcb5I^* c ■ ■ ■ cl^'^c. 

In the second stage, starting with 6, the part c • • • cc becomes 

a’^r-ib- ■ ■ ba^’~'^(’'^ ac, and is copied behind I^*c- ■ ■ I^'^c as I^''^c- ■ ■ cl^’’"‘(''^ c. 

In the third stage, starting with 9, the instruction part of the tape is restored, 
and a new cycle may start. 

The machine stops if in the first stage 7 encounters bcb. 

□ 
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UPM0(18,3) 





I 


c 


a 


T 


2 


7 




2 


I 


c 


a3 


3 


H 


c4 


I 


4 


16 


c5 


I 


5 


11 


c 


a 


6 


IC 


c3 


ID 


7 


7 


8 


aK 


8 


I 


c 


a9 


9 




c6 


10 


0 


lA 


c 


a 


A 


I 


c 


I8a 


B 


IE 


c 


a 


C 


I 


c9 




D 




cB 


aF 


E 


I 


c 


c8a 


F 


IG 


c 


a 


G 


I 


c 


c7a 


K 


a 


c 


a5 



Ni = 1, Nk+i = Nk + nik {l< k<n) 
blank : a 

Encoding of symbol Si : I^'c 
Encoding of ai : I^*^cac- ■ ■ caac 

Separators : a, cc 



The initial configuration is 

acaca^^^ cac ■ ■ ■ caca^^"'<^'> caac ■ ■ ■ caaca^"^ cac ■ ■ ■ caaccclll^^ c 

■ ■ ■ c . 

In the first stage is read, altogether Nr cac’s and caac's are changed 
into c/c’s and cllc’s, respectively, the J’s in between into a’s, and I^''cl^‘c is 
erased, giving 

aclcl^^^clc- ■ ■ del die- ■ ■ dlca^''^cac- ■ ■ caca'^’’"*(’')caac- • • 

■ ■ ■ cac^"™(")caaccc/8/^*c • • • d^'“c. 

In the second stage, starting with 8, the part cac ■ ■ ■ caca^'''^^’'daac is 
changed into I^''^dc- ■ ■ c/c/^’’’"(’')c/ac, and copied to the end of I^*c- ■ ■ I^'“c 

as c • • • c. 

In the third stage, starting with 7, the instruction part of the tape is restored, 
and a new cycle may start. 

The machine stops if in the second stage 9 encounters cd. 

□ 
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Abstract. We present three results on divisibility monoids. These di- 
visibility monoids were introduced in mi as an algebraic generalization 
of Mazurkiewicz trace monoids. (1) We give a decidable class of presen- 
tations that gives rise precisely to all divisibility monoids. (2) We show 
that any divisibility monoid is an automatic monoid |^. This implies 
that its word problem is solvable in quadratic time. (3) We investigate 
when a divisibility monoid satisfies Kleene’s Theorem. It turns out that 
this is the case iff the divisibility monoid is a rational monoid iff it 
is width-bounded. The two latter results rest on a normal form for the 
elements of a divisibility monoid that generalizes the Foata normal form 
known from the theory of Mazurkiewicz traces. 



1 Introduction 

Different mathematical structures have been proposed to model the behavior of 
concurrent systems, among them process algebras, sets of partially ordered sets, 
Petri nets etc. One particular approach in this line of research is that introduced 
by Mazurkiewicz El, now known as trace monoids. Since Mazurkiewicz ’s ob- 
servation that trace monoids can be used to model the behavior of 1-safe Petri 
nets, much research has dealt with the topic, see 0 for a collection of surveys. 
Despite their success, certain limitations of trace monoids have been observed. 
Therefore, several generalizations were considered. One of these generalizations 
are divisibility monoids cal] In this paper, we describe the relation to other 
classes of monoids known in theoretical computer science, namely to automatic 
El, rational ca and Kleene monoids. As corollaries, we obtain a quadratic 
lower bound for the complexity of the word problem and a characterization of 
those divisibility monoids that satisfy Kleene’s theorem. 

Mazurkiewicz traces model the sequential behavior of a parallel system in 
which the order of two independent actions is regarded as irrelevant. One con- 
siders pairs (S,I) where E is the set of actions, and J is a symmetric and 

^ Similar monoids have been considered in |7ltH where they are related to braid and 
other groups traditionally of interest in mathematics. 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 227 - 17 ^ 2001. 
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irreflexive binary relation on S describing the independence of two actions. The 
trace monoid or free partially commutative monoid M(27, 1) is then defined as 
the quotient where ~ is the congruence on the free monoid S* generated 

by all equations ab ~ ba with (a, b) S I. Thus, originally, trace monoids are 
defined by a decidable class of presentations. An algebraic characterization of 
trace monoids was given only later by Duboc m- Divisibility monoids are a 
lattice theoretically easy generalization of these algebraic conditions. Our first 
result (Theorem ^ describes a decidable class of presentations that give rise 
precisely to all divisibility monoids. Since the canonical presentations for trace 
monoids belong to this class, our result can be seen as an extension of Duboc’s 
characterization to the realm of divisibility monoids. 

For trace monoids, the word problem can be solved in linear time |H|. From 
our presentation result, an exponential algorithm for the word problem in divis- 
ibility monoids follows immediately. But we show that one can do much better: 
The work on automatic groups HS| has been generalized to the realm of semi- 
groups. Intuitively, a semigroup is automatic if it admits a presentation such 
that the equality can be decided by an automaton and such that the multipli- 
cation by generators can be performed by an automaton (rzE|. In particular, 
Campbell et al. jn| showed that the word problem for any automatic semigroup 
is solvable in quadratic time. Theorem El shows that any divisibility monoid is 
an automatic semigroup. Hence, we can infer from the result of Campbell et al. 
that the word problem for any divisibility monoid can be solved in quadratic 
time. We do not know whether this result can be improved, but we have serious 
doubts that a linear time algorithm exists. Kleene m showed that in a free 
finitely generated monoid the recognizable languages are precisely the rational 
ones. It is known that in general this is false, but Kleene’s result was general- 
ized in several directions, e.g. to formal power series by Schiitzenberger m , to 
infinite words by Biichi 0, and to rational monoids by Sakarovitch j^. In all 
these cases, the notions of recognizability and of rationality were shown to coin- 
cide. This is not the case in trace monoids any more. Even worse, in any trace 
monoid (which is not a free monoid), there exist rational languages that are not 
recognizable. But a precise description of the recognizable languages in trace 
monoids using c-rational expressions could be given by Ochmahski [22j . A fur- 
ther generalization of Kleene’s and Ochmahski’s results to concurrency monoids 
was given in m- The proofs by Ochmahski as well as by Droste heavily used 
the internal structure of the elements of the corresponding monoid. The original 
motivation for the consideration of divisibility monoids in PEI was the search for 
an algebraic version of these proofs. We succeeded showing that in a divisibility 
monoid with finitely many residuum functions, the recognizable languages coin- 
cide with the (m)c-rational ones (cf. for precise definitions of these terms). 
Thus, two main directions of generalization of Kleene’s Theorem in monoids are 
represented by Sakarovitch’s rational monoids and by trace monoids. Since the 
only trace monoids that satisfy Kleene’s Theorem are free monoids, these two 
directions are “orthogonal”, i.e. the intersection of the classes of monoids in con- 
sideration is the set of free monoids. In ini we already remarked that there are 
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divisibility monoids that satisfy Kleene’s Theorem and are not free. Thus, our 
further extension of Ochmahski’s result to divisibility monoids m is not “or- 
thogonal” any more. In this paper, we describe the class of divisibility monoids 
that satisfy Kleene’s Theorem. Essentially, Theorem 0 says that a divisibility 
monoid satisfies Kleene’s Theorem if and only if it is rational if and only if it 
is width-bounded. Thus, in the context of divisibility monoids, the classes of 
Kleene monoids and rational monoids coincide which is not the case in general 
and we give an internal characterization of these monoids. 

Our proofs that any divisibility monoid is automatic as well as the proof that 
any divisibility monoid satisfying Kleene’s Theorem is rational, use a normal 
form for the elements of a divisibility monoid. This normal form generalizes the 
Foata normal form from trace theory. It is studied in Section 0 Furthermore, 
we rely on the results by Campbell et al. from on automatic semigroups, by 
Sakarovitch from m on rational monoids, on basic properties of distributive 
lattices that can be found in |2] and on Ramsey’s Theorem |^. 

2 Basic Definitions 

2.1 Order and Monoid Theoretic Definitions 

Let (P, <) be a partially ordered set and y € P. By ly, we denote the principal 
ideal generated by y, i.e. the set {x ^ P \ x < y}. For x,y G P, we write x — < y 
if X < y and there is no element properly between x and y. A set A C P is 
an antichain if any two distinct elements of A are incomparable. The width of 
a partially ordered set (P, <) is the supremum over all natural numbers n such 
that there exists an antichain A C P with n = |A|. The width of P is denoted by 
w{P, <). If the width of (P, <) is finite, any antichain in P has at most w{P, <) 
elements. If the width is infinite, (P, <) contains finite antichains of arbitrary 
size. In particular, the width of a finite partially ordered set is always finite. 
A chain is a set C C P whose elements are mutually comparable. For x G P, 
the height h{x) in the poset (P, <) is the maximal size of a chain all of whose 
elements are properly below x. The length of the poset (P, <) is the maximal 
height of its elements. 

A lattice is a partially ordered set (P, <) where any two elements x,y G P 
admit a supremum x\/ y and an infimum x A y, i.e. a least upper and a largest 
lower bound. The lattice (P, <) is distributive if for any x,y,z G P we have 
X V (y A z) = (x V y) A (x V z). For many results concerning lattices see |2|. In 
particular, any two maximal chains in a finite distributive lattice have the same 
size. 

A triple (M, •, 1) is a monoid if M is a set, • : M x M — >■ M is an associative 
operation and 1 G M is the unit element satisfying 1 • x = x • 1 = x for any 
X G M. Let (M, -,1) be a monoid and X C M. Then, by {X) we denote the 
submonoid of M generated by X, i.e. the intersection of all submonoids of M 
that contain X. If {X) = M, A is a set of generators of M. The monoid M is 
finitely generated if it has a finite set of generators. Let A be a set. Then X* 
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denotes the set of all words over X. With the usual concatenation of words and 
the empty word as unit element, this becomes a free monoid generated by X. 

A subset L of a monoid M is rational if it can be constructed from the fi- 
nite subsets of M by the operations concatenation •, union U and iteration (.) 
(also known as Kleene-star). A set L C M is recognizable iff there exists a finite 
monoid {S, ■, 1) and a homomorphism rj : M ^ S such that L — rj~^r]{L). Rec- 
ognizable sets are sometimes called recognizable languages. In general, the sets 
of recognizable and of rational subsets of a monoid are different and even incom- 
parable. If the notions of recognizability and rationality coincide in a monoid 
M, then the monoid M is said to be a Kleene monoid. Kleene showed that this 
holds in free finitely generated monoids: 

Kleene’s Theorem [18J . Let X be a finite set. Then a set L C X* is rational 
iff it is recognizable. 

Let A be a set and f3 : X* X* a function which is not necessarily an 
homomorphism. Let furthermore M he & monoid. The function /3 is a normal 
form function for M if it is idempotent, the kernel ker(/3) = {(u, w) G X* x X* \ 
(3{v) = j3{w)} is a monoid congruence, and A*/ker(/3) = M. A monoid M 
is rational |25| if there exist a finite alphabet X and a normal form function 
j3 : X* — >• X* for M such that {(u,/3(u)) | v G X*} is a rational subset of the 
monoid X* x A*. 

In f25l23| , the authors are particularly interested in closure properties of the 
class of rational monoids. Sakarovitch Theorem 4.1] also shows that any 
rational monoid is a Kleene monoid (there are Kleene monoids which are not 
rational, see for an example). 

2.2 Divisibility Monoids 

Let M = {M, •, 1) be a monoid. We call M cancellative ii x ■ y ■ z = x ■ y' ■ z 
implies y = y' for any x,y,y',z G M. This in particular ensures that M does 
not contain a zero element (i.e. an element z such that z ■ x = x ■ z = z for any 
X G M). For x,z G M, x is a left divisor of z (denoted a: < z) if there is y G M 
such that X ■ y = z. In general, the relation < is not antisymmetric, but reflexive 
and transitive, i.e., a preorder. 

Lemma 1. Let (M, •, 1) be a cancellative monoid and a G M . Then the mapping 
a : {M, <) — >■ (a • M, <) defined by a(x) := a ■ x is a preorder isomorphism. 

Let A := (M \ {!}) \ (M \ {1})^. The set S consists of those elements of M 
that do not have a proper divisor, its elements are called irreducible. Note that 
A is contained in any set generating M . 

Definition 1. A monoid (M, -,1) is called a left divisibility monoid provided 
the following hold 

1. M is cancellative and its irreducible elements form a finite set of generators 

ofM, 
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2. X Ay exists for any x,y € M , and 

3. (fx, <) is a finite distributive lattice for any x € M . 

A left divisibility monoid is width-bounded if there exists a natural number n £ N 
such thatw{\.x,<) < n for any x £ M, i.e. if the widths of the distributive lattices 
fx are uniformly bounded. 

Note that by the third axiom the prefix relation in a left divisibility monoid 
is a partial order relation. Since, by Lemma y < z implies x ■ y < x ■ z, a, left 
divisibility monoid is a left ordered monoid. Ordered monoids where the order 
relation is the intersection of the prefix and the suffix relation were investigated 
e.g. in PI under the name “divisibility monoid”. Despite that we require more 
than just the fact that (M, •,<) be a left ordered monoid, this might explain 
why we call the monoids defined above “left divisibility monoid” . Since Birkhoff ’s 
divisibility monoids will not appear in our investigations any more, we will simply 
speak of “divisibility monoids” as an abbreviation for “left divisibility monoid” . 
“Divisibility semigroups” are investigated in several papers by Bosbach, e.g. |2|. 
Despite the similarity of the name, we baptized our monoids independently and 
there seems to be no intimate relation between Bosbach’s divisibility semigroups 
and our divisibility monoids. 

Let (M, - ,1) be a divisibility monoid and let x,y G M with x ■ y = 1. Then 
1 < X < 1 implies x = 1 since by the third axiom < is a partial order. Hence we 
have y = X ■ y = 1, i.e. there are no proper divisors of the unit element. 

By the second requirement on divisibility monoids, the partial order (M, <) 
can be seen as the set of compacts of a Scott-domain. This in particular ensures 
that any set ACM that is bounded above in (M, <) has a supremum in 
this partial order. Since, in addition, any element of M dominates a finit^ 
distributive lattice, (M,<) is even the set of compacts of a dl-domain (cf. P 
Ej)- Thus, we have in particular (x V y) A z = (x A z) V (y A z) whenever the left 
hand side is defined. 

Example 1. Using standard results from trace theory nm . it is easily seen that 
any (finitely generated) trace monoid is a divisibility monoid. Now let E = 
{a, 5, c, d} be an alphabet. Let be the least congruence on the free monoid 
E* that identifies the words ab and cd. In a trace monoid, the equality ab = cd 
implies {a, b} = {c, d} for any generators a, b, c, d. Hence the quotient monoid 
E* is not a trace monoid. But, as we will see later, it is a divisibility monoid. 
Similarly, let identify aa and bb. Again, E* is no trace but a divisibility 
monoid. Finally, identifying aa and be again results in a divisibility monoid as 
Theorem n below shows. 

Since a divisibility monoid (M, - ,1) is generated by the set E of its irreducible 
elements, there is a natural epimorphism nat : E* — >■ M. Let |x| denote the 
length of the lattice fx which equals the size of any maximal chain deduced by 

^ We just remark that the requirement (in the dehnition of a divisibility monoid) on 
the lattices fx to be finite is not really necessary since it already follows from the 
other stipulations m- 
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1. It is easily checked that x — < y iff there exists a G S with x ■ nat(a) = y for 
any x^y G M . Hence the maximal chains in ].x correspond to the words w G S* 
with nat(w) = x. This implies that any two such words have the same length 
which equals |a;|. 

Since nat is an epimorphism, there is a congruence ~ on the free monoid 
S* such that is isomorphic to the divisibility monoid (M, •). Hence, we 

can reformulate the stipulations in Definition ^ into requirements on the con- 
gruence E.g. the property of M to be cancellative would be reformulated to 
“for any words u,v,w G S* with uv ^ uw or vu ^ wu, we get v ^ w” . Although 
such a reformulation might look more effective than the original definition, it is 
not finite since it makes statements on words of arbitrary length. We now show 
that there is a decidable class of finite representations that gives rise precisely 
to divisibility monoids. 

In [El Lemma 3.4], we showed that the congruence ^ is generated by equa- 
tions of the from ab ~ cd for a, b,c,d G S. So let A be a finite set and let ~ be a 
congruence on the free monoid E* that is generated by all equivalences ab ^ cd 
for a,b,c,d G E. We aim at a characterization of the fact that M = E* is 
a divisibility monoid. In this monoid, the elements from E (more precisely, the 
equivalence classes [a] for a G E) are the irreducible elements since ~ is length 
preserving. Hence M is finitely generated by its irreducible elements. To ensure 
that M is cancellative, we need at least that the following holds for any elements 
a, b, c, 5', d G E: 



abc ^ ab' c' or bca ~ b'c'a implies be ^ b'c'. (2-1) 

Note that (EH) requires the cancellation for words of length 3, only. In the same 
spirit, we now weaken the second requirement concerning the existence of infima: 
Suppose 6 yf c, but ab ^ a'b' and ac ~ a'd for some a, b, c, o', 6', d G E. Then one 
can infer from EH) that ab / ac. Since ~ is length preserving, [a] and [o'] are 
maximal lower bounds of [a6] and [ac] . Since by the second axiom of divisibility 
monoids infima of any two elements exist, we obtain a = a' . Thus, the following 
requirement is a weakening of the above mentioned second axiom to words of 
length 2: 

ab ^ a'b', ac ^ a'd and 6 yf c imply a = a' (2-2) 

for any letters a,b,c,a',b',d G E. The third axiom on divisibility monoids is 
restricted verbatim to words of length 3: 

(|([a6cj), <) is a distributive lattice (2.3) 

for any letters a,b,c G E. The following theorem states that the three properties 
we identified are sufficient to characterize all divisibility monoids: 

Theorem 1. Let E be a finite set and E a set of equations of the form ab ^ cd 
with a,b,c,d G E. Let ~ be the least eongruenee on E* eontaining E. Then 
E* is a divisibility monoid if and only if iH. 1\) . lU.m) and iit. hold for any 
a,b,c,a',b',d G E. Conversely, eaeh divisibility monoid arises this way. 
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We indicated that indeed any divisibility monoid arises this way. For the 
first statement let R = {ab — >■ cd \ (ab,cd) G E} be the (symmetric) semi Thue 
system associated with the set of equations E. Any two i?-equivalent words can 
be transformed into each other by at most one application of a rule from R at 
the first position (this statement is proved using deep results from the theory of 
semimodular and of distributive lattices [2). From this property of R, one can 
then infer that E* is a divisibility monoid (cf. ^1 Chapter 8] or |2D]). 

Note that for a finite set of equations E of the form ab ^ cd, it can be 
checked effectively whether the three properties ll 2 . 1 1 . liz.'zi) and (E3» are satisfied 
by the least congruence containing E. Hence Theorem Q] describes a decidable 
class of finite presentations that give rise to the class of all divisibility monoids. 
Christian Pech of Dresden using the GAP4-system m computed that there are 
219 divisibility monoids with 4 generators and 8371 divisibility monoids with 5 
generators as opposed to only 10 resp. 34 trace monoids. 



3 A Foata Normal Form 

Throughout this section, let (M, -,1) be a fixed divisibility monoid and let E 
denote the set of its irreducible elements. In this section, we will define a Foata 
normal form for elements of M that generalizes the known Foata normal forms 
from the theory of Mazurkiewicz traces. These Foata normal forms are the basis 
for our proofs in the following sections that any divisibility monoid is automatic 
and that a width-bounded divisibility monoid is rational. We define the set of 
cliques to consist of all nonempty subsets of E that are bounded above, i.e., 
®={ACZ'|0y^A and sup(A) exists} 

Next we define the set FNF C 6f* of Foata normal forms as 

{A 1 A 2 . . . A„ S 6f* I Vt S Ai+iVH G on : supB ^ (sup A^) • t for 1 < i < n}. 

Since the condition that constitutes membership in FNF is local, FNF is a ra- 
tional language in the free monoid &* . Let a' : & ^ M denote the mapping 
that associates with any clique A G its supremum sup A in M . This map- 
ping extends uniquely to a monoid homomorphism a from CX* to M. Then 
a(AiA 2 . . .An) = (sup Ai) • (sup A 2 ) • • • (supA„). This mapping is not injective, 
but surjective since o:({ai}{a 2 } . . . {on}) = oi • U 2 • • • for any Ui G E and E 
generates M. The set FNF is particularly useful since it provides normal forms 
for the elements of M, i.e. since the restriction of a to FNF is a bijection: 

Lemma 2. The mapping a ( FNF : FNF M is bijective. 

Proof. To show injectivity, one proves for any A 1 A 2 . . . A„ G FNF that Ai is the 
set of irreducible elements a G E that divide a(Ai A 2 . . . A„) and continues by 
induction. 

To show surjectivity, one builds the Foata Normal Form of x inductively by 
setting X\ = X, Ai = {a G E \ a < Xi}, and a(Ai) ■ Xi+i = xi. □ 
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Thus, for any x G M, the set a~^{x) fl FNF is a singleton. We denote the 
unique preimage of x in FNF by fnf(cc) and call it the Foata Normal Form of x. 

Now let P : Gt* — >■ Gi* be defined by /3 = fnfoa, i.e. PiW) is the Foata normal 
form of the element a{W) for any word W over the alphabet 6f. This function is 
idempotent. Since fnf is injective, we obtain ker(a) = ker(/3). Since a is a monoid 
homomorphism, ker(/3) is a congruence. Finally Cf*/ker(/3) = 6f*/ker(a) = M 
holds since a is surjective. Thus, we obtain 

Lemma 3. The function fnf o a : G£* — >■ G£* is a normal form function for the 
divisibility monoid M . 

Next we show that the Foata Normal Form of nat(w) can be computed from 
the word w G S* by an automaton. In general, this automaton has infinitely 
many states, but for width-bounded divisibility monoids its accessible part will 
be shown to be finite. 

An automaton over the monoid S' is a tuple A = {Q, S, E, /, F) where 

1. (5 is a set of states, 

2. A C Q X S X Q is a set of transitions, and 

3. I,F C Q are the sets of initial and final states, respectively. 

The automaton A is finite if E is. A computation in A is a finite sequence of 
transitions: 

Q.T <2o ^71 

Po Pi A P2 >Pn- 

It is successful ifpo € I and p„ G E. The label of the computation is the element 
oi • 02 • • • a„ of the monoid S. For a computation with first state po, last state 
and label a, we will usually write po A- without mentioning the intermediate 
states. The behavior of A is the subset \A\ of S consisting of labels of successful 
computations in A. 

Lemma 4. There exists an automaton Am with state set M x (6f U {e}) over 

the monoid E* x Q?* that has a computation (1,£) ^ (z,C) iff 

Bra = C, fnf(nat(i(;) • z) = B 1 B 2 ■ ■ ■ Bm, and |fnf o nat(w)| = m. 

for any w G E* , Bi £ Gi for 1 <i <m, z £ M and C £ GE^ = GiU {e}. 

Proof. For {x,A), (z,C) £ M x GEe and (a,B) £ E x GE^, there is a transition 
(x. A) ^ (z, C) iff 

1. a < X, B = e, a ■ z = X, and C = A, or 

2. a ^ X, B = C jE e, AB £ FNF, and a - z = x ■ (sup B). □ 

We can think of (1, e) gg denoting the fact that, on input 

of the word w £ E* , the automaton outputs B 1 B 2 ■ ■ ■ Bm and reaches the state 
(z,C). Thus, by the lemma above, the automaton stores the last letter of its 
output Bm from GE in the second component C of the state reached. Further- 
more, it outputs the Foata normal form of some element nat(w) ■ z of M that 
is an extension of the input. The “difference” between the input nat(w) and the 
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output a{BiB2 ■ ■ ■ Bm) (seen as elements of M) equals z and is stored in the 
first component of the reached state. The last property mentioned in the lemma 
above ensures that the Foata normal forms of nat(ic) and of nat(w) • z have 
the same length m. Intuitively, the difference between the input nat(rc) and the 
output a{BiB2 ■ ■ ■ Bm) is not too “large”. 

Note that the automaton outputs only elements of FNF. Even more, if z = 1 
the automaton outputs the Foata normal form of the input. Let ( 1 ,£) be the 
only initial state and let the set of final states be { 1 } x Then the behavior 
of the automaton Am is (the graph of) the function fnf o nat : E* — >■ &*. 

4 Divisibility Monoids Are Automatic ~ The Word 
Problem 

Since the equations that define a divisibility monoid are length preserving, the 
word problem for any divisibility monoid is decidable. In this section, we show 
that it can be solved in quadratic time. 

For an alphabet E, let Eg = T’U{£} and E2 = Eg x Eg \ {(£,e)}. Let 
furthermore v = S1S2 ■ ■ ■ Sm and w = tit2 . . . be words over E with Si, tj G E. 
We define 



{ (si,ti)(s 2 ,t 2 ) ■ • ■ (Smtn)(£5 ^n-\-l) (^^<^^71+2) ■ • ■ (£; ^m) if Tl TTl 

(si,fi)(s2,t2) ■ • • (s„,t„) if n = 771 

(si,ti)(s 2 ,t 2 ) ■ • ■ (Sm: tm) (Sm+l ; £) (Sm+ 2 ; £) ■ ■ • (Sn 5 £) if 77 > 777 

Then (v,w)^ is an element of the free monoid E^. Now let M be a monoid, 

E a finite set, L C E* a recognizable language in the free monoid E* and 
rj : E* — > M a homomorphism. Then {E, L, rj) is an automatic structure for M 
0 if v{L) = M and if the sets 

L= = {(77,77;)* I 77,77; G L and 77(77) = 77(777)} and 
La = {('i^jW;)* I 77,777 G L and 77(770) = 77(777)} 

are rational subsets of for any a G E. The monoid M is an automatic monoid 
if it admits an automatic structure. 

We will show that any divisibility monoid is automatic. More precisely, let 
M be a divisibility monoid. For A G choose a word 777^ G E* with nat (777 a) = 
a{A). Then the mapping Qt — >■ E* : A 1— >■ wa admits a unique extension to a 
monoid homomorphism ip : — >■ E*. Let L = 7/; (FNF) denote all words over 

E of the form ilj{Ai)ijj{A2) . . . ip{An) for some cliques Ai such that A1A2 . . . 
is a Foata normal form. We are going to prove that the triple {E,L, nat) is an 
automatic structure for the divisibility monoid M : 

Note that a = nat 07/1. Since, by LemmaOl the mapping a \ FNF is surjective, 
we obtain M = a(FNF) = nato7/;(FNF) = nat(L). 

Next, we show that is rational. Recall that FNF C Cf* is rational. Hence 
its image L with respect to the homomorphism 7/7 is rational, too. This implies 
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immediately that the set {(■(;,?;)* | u G L} is rational in Now let v^w & L 
with nat(f ) = nat(w). Then v = tp o fnf o nat(f ) = tp o fnf o nat(w) = w. Hence 
we showed L- = {(?;,?;)* \ v G L} which is rational in 

It remains to show that for a G E the set is rational in Elp. Note that 
La = {(?^,'0(fiifonat(r;a))) | v G L}. To show that this set is rational, we have to 
construct an automaton that outputs o nat(ua)) on input of v for v G L. 

Since this can indeed be achieved (cf. [201 )? we obtain 

Theorem 2. Let M he a divisibility monoid. Then M is an automatic monoid. 

Using that the word problem for automatic monoids can be solved in 
quadratic time jS], we obtain immediately 

Corollary 1. Let M he a divisibility monoid. Then the word problem for M can 
be solved in quadratic time. 



5 When Does Kleene’s Theorem Hold? 

As mentioned in the introduction, Kleene’s Theorem holds in a divisibility 
monoid M iff it is width-bounded iff it is rational. To obtain this result, we 
first sketch the proof that any width-bounded divisibility monoid is rational. 
The crucial step in the proof is expressed by the following lemma 

Lemma 5. Let (M, •, 1) be a width-bounded divisibility monoid and n G N such 
that w(4,a:,<) < n for any x G M. Let x,z G M with |fnf(a; 2 :)| = |fnf(x)|. Then 
\z\ < 2(n -I- 1)|A’|. 

Using Lemma 0 we obtain that the automaton Am has only finitely many 
reachable states for M width-bounded, i.e., the reachable part A of the automa- 
ton Am is a finite automaton. Note that the behavior of A and that of Am 
coincide. Thus, by d, the behavior |A| = \Am\ = {(w,fnf onat('u;)) | w G A*} 
is a rational set in the monoid E* x Gb*. 

We consider again the homomorphism ^p : G£* — >■ E* that we constructed in 
the previous section. Since ip is a homomorphism, the set {{W,ip{W)) \ W G 
is rational in G£* x E* . Furthermore, nato^ = a and therefore fnf o a = 
fnf o nat oip. Hence the function fnf o a is the product of two rational functions 
fnf o nat and ip. This implies by [O] that the set {(IT, fnf o a{W)) \ W G 6f*} 
is rational since E* is a free monoid. Recall that by Lemma 0 the function 
fnf oa is a normal form function for the divisibility monoid M. Hence we showed 
that a width-bounded divisibility monoid indeed admits a rational normal form 
function, i.e. is a rational monoid: 

Proposition 1. Any width-bounded divisibility monoid is rational. 

By E3 Theorem 4.1], this proposition implies that any width-bounded divis- 
ibility monoid is a Kleene monoid. The remainder of this section is devoted to the 
inverse implication: Let M be a divisibility monoid that is not width-bounded. 
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i.e., for any n G N, there is z € M such that the width of the distributive lattice 
4,2 is at least n. This implies, that for any k G N, there exists z G M, such 
that the lattice ({1, 2, 3, , fc}^, <) (with the coordinatewise order) can be em- 
bedded into the lattice 4_z. Let / be such embedding. If the divisibility monoid 
M has finitely many residuum functions (see below), we can apply Ramsey’s 
Theorem PH twice and obtain 1 < z < < n, 1 < j < / < n, and x,y G M 

such that ■ x = f(i',j) ■ x = f(ij) ■ y = and 

fihj') • y = /(*^j0• By cancellation, this implies x ■ y = y ■ x. Furthermore 
X \\ y since / is a lattice embedding and M is cancellative. This was the proof 
of the following lemma that we state after defining residuum functions: Let M 
be a divisibility monoid and x,y G M . We say that x and y are complementary 
(denoted cc || y) if a: A ?/ = 1 and the set {x,y} is bounded above. Hence, two 
nonidentity elements of M are complementary if they are complements in one 
of the lattices with z G M . Since bounded elements have a supremum, there 
exists y' G M with x ■ y' = x \/ y. This element y' is uniquely determined by x 
and y since M is cancellative. We call it the residuum of y after x and denote 
it by rx{y)- Thus, r^ is a partial function from M to M whose domain is the 
set of elements that are complements of x. Let Km denote the set of all partial 
functions r^ for some x G M. The divisibility monoid M is said to have finitely 
many residuum functions if K-m is finite 0 

Lemma 6. Let (M, -,1) be a divisibility monoid with finitely many residuum 
functions that is not width-bounded. Then there exist x,y G M \ {1} such that 
X \\ y and x ■ y = y ■ x. 

Now we can characterize the divisibility monoids that satisfy Kleene’s The- 
orem. 

Theorem 3. Let (M, •, 1) be a divisibility monoid. Then the following assertions 
are equivalent 

1. M is width-bounded, 

2. M is a rational monoid and has finitely many residuum functions, and 

3. M is a Kleene monoid and has finitely many residuum funetions. 

Proof. A width-bounded divisibility monoid is rational by Proposition E Now 
let Si,ti G M for 1 < i < n such that x = si ■ S 2 ■ ■ ■ Sn and y = ti ■ t 2 ■ ■ ■ tn are 
complementary. Then the elements Si • S 2 • • • V ti • ^2 ’ • ’ tn-k for ^ k < n 
form an antichain that is bounded above by z = x\/ y. Hence the lattice fz has 
width at least n — 1. Now let M be width-bounded such that the lattices 4-z have 
width at most n — 2. Then, as we just saw, at most one of two complementary 
elements of M has length at least n. Let denote the finite set of elements of 
M of length at most n. For x G M \ M„, the domain of r^ is therefore contained 
in Mn and, since r^ is length-preserving, so is its image. Hence there are only 
finitely many residuum functions Vx for x G M \ Mn. Since Mn is finite, M 
therefore has only finitely many residuum functions. 

^ It is not known whether this class is a proper subclass of all divisibility monoids. 
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Any rational monoid is a Kleene monoid by Theorem 4.1]. For the re- 
maining implication assume by contradiction M not to be width-bounded. Then, 
by LemmaEl there are x,y € M\{1} such that x-y = y-x and x || y. Hence the 
mapping (0, 1) i— >■ a; and (1,0) i— >■ y can be extended to a monoid embedding from 
(N X N, -I-) into M. The image of {(n, n) | n S N} in M under this embedding is 
a rational set which is not recognizable. Thus, M is not a Kleene-monoid. □ 

6 Open Questions 

There are several open questions that call for a treatment: Is the lower bound 
for the complexity of the word problem given in Corollary Q optimal? We did 
not consider the nonuniform word problem, i.e. the complexity of an algorithm 
that takes as input a presentation as described in Theorem^and two words and 
outputs whether these two words denote the same monoid element. Furthermore, 
we still do not know whether there exist divisibility monoids with infinitely many 
residuum functions. 
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Abstract. We introduce Concurrent Timed Automata (CTAs) where 
automata running in parallel are synchronized. We consider the sub- 
classes of CTAs obtained by admitting, or not, diagonal clock constraints 
and constant updates, and by letting, or not, sequential automata to up- 
date the same clocks. We prove that such subclasses recognize the same 
languages but differ from the point of view of succinctness. 



1 Introduction 

Since their introduction by Alur and Dill P, Timed Automata (TAs) have been 
one of the most studied models for real-time systems. TAs extend classic uj- 
Automata by introducing variables measuring time, called clocks: Transitions 
are guarded by clock constraints, which compare the value of a clock with some 
constant, and perform reset updates, which reset a clock to the initial value 0. 

Later, both diagonal clock constraints, which compare the values of two 
clocks, and constant updates, which set the value of a clock to some constant, 
have been added to TAs. These new features do not increase the class of lan- 
guages accepted by TAs (see m)- In fact, for each TA of size n and with d 
diagonal clock constraints there is an equivalent TA of size nx2‘^ without diago- 
nal clock constraints (see 0 for the proof). Moreover, for each TA of size n there 
is an equivalent TA of size polynomial w.r.to n and with only reset updates. 

Extensions of TAs to deal with parallelism have been proposed by Bornot and 
Sifakis [3E| and by Lanotte et al. [B|. In the model of |3E], an action from the 
environment is sensed either by all automata running in parallel, if the action is 
a “communication action”, or by one single automaton, otherwise. In the model 
of the environment performs more than one action at the same instant, and 
each of the automata running in parallel can sense its own subset of actions. 

In the present paper we propose a variant of TAs with parallelism, called 
Concurrent Timed Automata (CTAs), where automata running in parallel are 
perfectly synchronized, in the sense that they can compute only by sensing, at 
each instant, the same action from the environment. CTAs can be translated 
into equivalent Alur and Dill’s TAs by using the cartesian product. CTAs with- 
out clocks correspond to Drusinsky and Harel’s Concurrent Automata [Z|. Such 
Concurrent Automata can be mapped to w- Automata, and the exponential lower 
bound of this mapping has been proved in [7|- From this result, the exponential 
lower bound of mapping CTAs to Alur and Dill’s TAs follows. 
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Our model is included in those of m and of 0. We show that notwisthand- 
ing its simplicity, our model is suitable to investigate the role that some features, 
which may or may not be offered by the model, play in the parallel setting. 

First of all we analyze the power of diagonal clock constraints. We show that 
mapping CTAs with diagonal clock constraints to CTAs without diagonal clock 
constraints is always possible, and we prove both the exponential lower and 
upper bounds of such a mapping. Note that the upper bound in the sequential 
case is polynomial w.r.to the number of states and transitions and exponential 
only w.r.to the number of diagonal clock constraints (see 0). 

Then, we analyze the power of constant updates. We show that mapping 
CTAs with constant updates to CTAs with only reset updates is always possible, 
and we prove both the exponential lower and upper bounds of such a mapping. 
Note that the upper bound in the sequential case is polynomial. 

Finally, we consider CTAs with private clocks, i.e. the subset of CTAs where 
the value of a clock can be updated by only one sequential automaton (the owner 
of the clock), and can be read by all sequential automata running in parallel. 
We prove the exponential lower and upper bounds of mapping CTAs without 
diagonal clock constraints and with constant updates to CTAs with private clocks 
and with the same sets of clock constraints and updates. We prove also the 
polynomial upper bound of mapping the other subclasses of CTAs (i.e. CTAs 
with diagonal clock constraints and CTAs without diagonal clock constraints and 
with only reset updates) to the corresponding subclasses with private clocks. 

We view our results as a starting step towards the study of parallelism in 
TAs. We intend to study in the future the role of the nondeterministic updates, 
introduced in inini, in the parallel setting. We intend also comparing communi- 
cation through clocks, which is used both in our model and in that of m. with 
explicit communication (automata view the current state of other automata and 
the time elapsed since their activation), which is used in the model of |S|. 

2 Concurrent Timed Automata 

Timed words 

Given a time domain T , a timed sequence over T is an infinite non decreasing 
sequence (ti)j>i such that, for each t gT, there is some ti > t. 

Given a finite alphabet of actions S, a timed word lo = (ai,ti)i>i over S and T 
is an infinite sequence such that (ai)i>i is a sequence of actions, denoted also 
untimed{uj), and (ti)i>i is a timed sequence over T. 

Clock valuations and clock constraints 

We assume a set X of variables, called clocks. A clock valuation over A is a 
mapping v : X ^ 'T assigning time values to clocks. For a clock valuation v and 
a time value t, v + t denotes the clock valuation such that {v + t){x) = v(x) + t. 

Given a subset of clocks X C X , the most general set of clock constraints 
over X, denoted C(X), is defined by the following grammar, where (f> ranges over 
C{X), x,y G X, q gT and # € {<,<,=, yf, >, >}• 

(j) ::= x#( 7 |a: — (f)\ true 
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Constraints of the form x — y^q are called diagonal dock constraints. The subset 
of diagonal free clock constraints, denoted Cdf{X), excludes such constraints. 
We write v \= 4> when the cloek valuation v satisfies the clock constraint 4>. 
Formally, v \= xffq iS v(x) ffq, v \= x — yffq iS v(x) — v(y) ffq,v \= <f>i /\ 4>2 
V \= 4>i and v |= 4>2, v |= ~'4> iff u ^ u |= (/)i V ^2 iff w |= (/)i or u |= (j) 2 , and 
V 1= true. 

Updates 

Updates modify values of clocks. Formally, an update up over a set of clocks X is 
a finite collection (Mp*)i<i<fc, where up^ is a simple update of the form Xi := qi, 
with Xi & X and qi G T, and Xi = Xj implies qi = qj, for every 1 < i < j < k. 
The update up maps a given clock valuation v to the valuation up{v) such that: 

, , , , f Oi if X = Xi and up* is the simple update Xi := qi 

UpiVjix) = S , \ .r ^ i 

\u(a;) if X ^ {xi, . . . ,Xfc} 

For a set X C X, let U{X) denote the set of updates over X, and let Uq{X) de- 
note the set of reset updates, i.e. updates with simple updates of the form Xi := 0. 



The formalism 

A Concurrent Timed Automaton is a tuple A= (X, X, Ai, . . . , Am, R), where: 



— A is a finite alphabet of actions. 

— A is a finite subset of clocks in X. 

— Ai = (Si,s^,Si) is a sequential automaton, with states Si = {s°, . . . , 
initial state s° and set of transitions Si C Si x [C{X) x E x U{X)] x Si. 

^ Rf= Ul<i<m is the set of repeated states. 



A configuration of A is a collection of states c = , . . . , s^) & S\ x . . . x Sm- 

The initial configuration cq is the collection of the initial states (s®, . . . , s^). 
There is a step from a configuration c = (sj* , . . . , to a configuration 
c' = (sj*,...,s^) through action a, written c ==^ c', if there are transitions 

U . (j)'i ,, ^ ^ /iij I 1\ 1/ \* 

5^* — > 1 < 2 < m (aii automata read a), and [upi ^ . . . ^upm) is an up- 

date. 

A path P in A is an infinite sequence of steps cq ci C 2 .... It is accepting 
if it passes infinitely often through configurations containing some state in R. 



A run r of A through the path P is a sequence (co,wo) 



a 1 , i 1 



(ci,Ul) 



02,^2 



{C2,V2) 

/ hi I 
(Si ' ,.. 

sitions Si 



., where (ti)i>i is a time sequence, and, if Ci is the configuration 



, and the step Ci 






> Ci+i of P is formed by the set of tran- 
’ , with 1 < j < m, then it holds that: 



— vo(x) = 0 for every x € X 

— Vi + (ti+i — ti) \= for each 1 < j < m 

— Vi+i = (■up*+\ . . .,up^'^){v, + (ti+i - U)). 



The label of the run r is the timed word oj = (ai,<i), ( 02 , t 2 ) • • •• If the path P 
is accepting then the timed word to is accepted by A. The set of all timed words 
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accepted by A is denoted by £(Al), and is the (timed) language accepted by A. 
We denote with untimed(£(A)) the set {untimed(io) \ uj G £(A)}. 

The subclasses 

Given sets C C C(X) and U C U(X), the class CTA(C,U) contains the CTAs 
whose transitions are guarded by constraints in C and perform updates in U. 

The subclass of CTA(C,14) with private clocks, denoted CTApr(C,U), con- 
tains the CTAs, A = (X, X, Ai, . . . , Am, R) in CTA(C,U) where there is a parti- 
tion Xi, . . . , Xm of X s.t., if an update a; := g is in Ai, then x G Xi. Ai is called 
the owner of the clocks in Xi and is the unique automaton that updates them. 

The subclass of sequential CTA(C,U), denoted STA(C,U), contains the CTAs 
in CTA(C,U) having only one sequential component. Alur and Dill’s model corre- 
sponds to STA(Cdf(X),Uo(X)). Note that STA(C,U) and STApr(C,U) coincide. 

We shall consider the classes of FiglU We shall prove that they do not differ 
from the point of view of expressiveness, but do differ from that of succinctness. 
Given two classes of CTAs C and C , we write: 

— C ^ C' iff for each element A G C there is an equivalent (i.e. accepting the 
same language) element AI G C that is polynomial w.r.to A, and conversely; 

— C C iff for each element A G C there is an equivalent element A' G C 
that is at most exponential w.r.to A] 

— C — > C iff there is a family of languages (Tm)m>o accepted by a family 
(Am)m>o G C, with Am polynomial w.r.to m, and every family (Bm)m>o G C 
accepting (Lm)m>o is such that Bm is at least exponential w.r.to m; 

— C — ^ C' iff C -4 C' and C C, and C ^ C' iff C ^ C' and C C. 
Theorem 1. All relations of Fig^ hold. 

In Sect. 0 we prove that C ^ C for each pair of classes C, C such that C ^ C 
is in Fig0 In Sect. 0(resp. Sect. 0 ) we prove that C C (resp. C — ^ C) for 
each pair of classes C, C such that either C — ^ C or C A — ^ C is in Figd 



CTA(C(X),U(X)) ~ CTApr(C(X),U(X)) 




~ CTApr(C(X),Uo(X)) 



CTA(Cdf(X),Uo(X)) ~ CTApr(Cdf(X)Mo(X)) 



Fig. 1. Relations of succinctness 
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3 Polynomial Reductions 



Here we prove that all relations ^ in Figniiold. Without loss of generality, we 
assume that clock constraints x # g and x — j/ # g are such that # G {>,<}■ 



Proposition 1. CTA{C{X),Uo{X)) ~ CTApr{C{X) Mo{X)) . 



Proof. Since CTA{C{X),Uo{X)) D CTApr{C{X),Uo{X)), it suffices to prove 
that, for each A = {X, X, Ai, . . . , A.^, R) G CTA{C{X) ,Uq{X)) , there is some 
A' G CTApr{C{X) ,U q{X)) polynomial w.r.t. A such that C{A) = C{A'). 

We obtain A' = (X, ^ ^i, • ■ ■ , A'^, R) as follows: 

1. each clock x is replaced by xi, . . . , Xm, owned hy A[, respectively; 

2. each reset x := 0 in Ai is replaced by Xi := 0 in A'; 

3. each constraint x > q (resp.: x < q, x — y > q, x — y < q) is replaced 

by ^ VzG[l.m]^* < A*G[l.m] VjG[l,m] 2;^ ~ Vj > 9, 

ViG[l,m] AjG[l,m] ~ Vj ^ ' 



Intuitively, x is reset in A iff some x^ G {xi, . . . ,Xm} is reset in A'. Therefore, 
X > q (resp. x < g) iff the clock Xh G {xi, . . . , Xm} that has been most recently 
reset satisfies Xh>q (resp. Xh < q), i.e. AiG[i,m] > g (resp. V*G[i.m] <^)- 
Moreover, if yk G {yi,...,ym} is the clock replacing y that has been most 
recently updated, x — y > q (resp. x — y < q) iS Xh — yk ^ q (resp. Xt — yk < q), 



b®- AiG[l,m] 



Vj-G[l.m] - Vj > (resp. V,G[l.m] AjG[l.m] ~ Vj < ?) • 



We note that A! is polynomial in size w.r.t. A. We prove that C{A!) C C{A). 
The other inclusion can be proved analogously. 

Assume a timed word uj' = (ai, ti)(a 2 , ^ 2 ) ■ • ■ G C{A'). There is a run r' = 

(co,Vq) (ci,x() . . . such that cq ci ... is an accepting path. 

Let Vo = Vq. To prove that tv' G C{A), we prove that there is a run r = 



(co,vo) (ci,vi) . . . such that Vi(x) = min{v'(xi), . . .,v'(xm)}- 



0-2, t 



It suffices to prove that {ci,vf) implies {ci,Vi) 

{ci+i,Vi+i) . This follows from two facts: 

1) If v[ + ti+i — ti \= (j)' for an arbitrary constraint then Vi + — ti \= 4>, 

with (j) the constraint from which (f>' is derived. This follows from the definition 
of 4>' and the relation between v[ and Vi. 






2) If (c„x') 



(Ci+i,x'_|_i) resets xu, then {ci,Vi) 



{ci+i,Vi+i) resets 



X, and 0 = Vi+i{x) = min{vl^j^{xi), . . . ,0, . . . , w'+dxm)}- 

If (ci, v'f) resets no clock in {xi, . . . , Xm}, then it holds that 

Vi+i{x) = Vi{x) + ti+i - ti = min{vl{xi) + U+i -U,..., v[{xm) + U+i - U} = 
min{v^_^_J^{xl),...,v^_^_^{xm)}■ □ 



Let us consider the CTA A' of the proof above. Since diagonal clock con- 
straints are derived only from diagonal clock constraints that appear in A, if 
A G CTA{Cdf{X)Mo{X)) then A' G CTApr{Cdf{X)Mo{X)). Therefore, we in- 
fer the following result. 

Proposition 2. CTA{Cdf{X)Mo{X)) - CTApr{Cdf{X),Uo{X)). 
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We consider now CTAs with diagonal constraints and constant updates. 

Proposition 3. CTA{C{X),U{X)) ~ CTApr{C{X),U{X)). 

Proof. Since CTA{C{X),U{X)) A CTApr{C{X),U{X)), it suffices to prove that, 
for each A = [S, {x^ , . . . ,x^}, Ai, . . . , Am, R) G CTA{C{X),U{X)), there is 
some A' G CTApr{C{X),U{X)) polynomial w.r.t. A such that C{A) = L{A'). 

The proof is done in two steps. In the first step we construct the CTA A” = 
(27, X", A'i, . . . , A" , A" +1, . . . , A" R) G CTA{C{X),U{X)) such that: 

— X” = X\J Ufc6[i.^] Uje[i.n]{4> U 

— Each component with fc G [1, m], is obtained by adding to each transition 

t in Ak, and for each clock xA either {s-^ := 0,t-[ := 1}, if x^ is updated in 
t, or {s|, := 0,ti ■= 0}, otherwise. Note that = 1 if A'^ updates xf 

whereas tj. — $1. = 0 otherwise. 

— Each component A'f^_^_j, with j G [l,n], keeps track of the component that 

has most recently updated x^ . More precisely, s®^s clocks and so 

that, for each 1 < /c < m, — j/-' = fc — 1 iff the most recent update on x^ 
has been done by the component A'i^. We define as follows: 

• ^m+j states isi,willi, . . . ,ism,willm, with isi initial. State isk is 
entered iff A'^, is the component that has most recently updated x^ and 
^m+j predicts that x^ is not updated in the current step. State willk is 
entered iff A'f^_^j predicts that A'^ is updating x^ in the current step. 

• For each action a there are transitions 

1 . {ish, -sj = 0),a,{},ish} 

2 . {ish, -sj = 0),a,{z^ := k - l,y^ := 0 },^/;^) 

3 . (willk,tl - sj. = l,a,{},isk) 

4. {willk, - si = ^,a,{z^ := h- l,y^ := 0},willh). 

The first checks that the prediction that x^ was not updated in the past 
step is correct — Si = 0)), and predicts that x^ is not updated 

in the current step {ish is entered). The second checks that x^ was not 
updated in the past step and predicts that x^ is updated in the current 
step by A'l {willk is entered). The third checks that x^ was updated by 
A'l, in the past step = 1) and predicts that x^ is not updated in 

the current step. The last checks that x^ was updated by A'l in the past 
step and predicts that x^ is updated by A'l^ in the current step. 

In the second step we replace each clock x^ hy x\, ... , xf^, with x\ owned by A”. 
We replace each update x^ := q in A" with x{ := q. We add n components that 
ensure that if in a step two different clocks xf, xl. replacing x^ are updated, then 
they have the same value. We derive A' = [S, X', A{,. ■ ■ , RRm+ 2 n, R) follows: 

-X' = (X"\X)uU,,p.„]Ue[iH{a^^} 

— for each i G [1, m], A[ is obtained from A” by replacing each update x^ := q 
with x\ := q, and by modifying constraints of transitions as follows: 
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• >q becomes ~ = h - I ^ > q) 

• x^ <q becomes AhG[i.m](^^ ~ = h - I ^ x^ < q) 

• x^ -x^ > q becomes A?iG[i.m](^^ - yJ = h - 1 ^ AfcG[i.m](^* -y" = 
k-l^xi~xl> q)) 

• x^ -x^ < q becomes A/.G[i,m](^^ ~ = h-1 ^ AfcG[i,m](^* -y" = 

k - x{- xl < q)). 

- for each I < j < n, = A" 

— for each 1 < j < n, checks that if in the previous step two different 

clocks a;^ and a;^ were both updated, then they have the same value. It has 
a state s and transitions (s, A/ig[i = 1) A {tj, — sj, = 0)) 

^h~^k = l.aAl^s)- 

A” (resp. A!) is polynomial w.r.t. A! (resp. A) and C{A”) = C{A') = C{A). □ 

4 Exponential Upper Bounds 

Here we prove that CTA{C{X),U{X)) CTA{Cdf{X),Uo{X)), from which it 
follows that all arrows in FigQ represent exponential upper bounds. 

Proposition 4. CTA{C{X),U{X)) CTA{Cdf{X),Uo{X)). 

Proof. Given any CTA A € CTA{C{X),U{X))., we obtain an equivalent au- 
tomaton S € STA{C{X),U{X)), exponential w.r.to A, by cartesian product. 
Following PI, we transform S into an equivalent S' G STA{Cdf{X),l4{X)), with 
|5'| = |5| X 2‘^, where |5'| and |5| are the size of S' and S, and d is the number of 
diagonal constraints in S. So, S' is exponential w.r.to A. As pointed out in |n|, 
S' can be transformed into an equivalent S" G ST A{Cdf{X) ,Uo{X)) , polynomial 
w.r.to S' . So, S" is equivalent to A and exponential w.r.to A. □ 

Corollary 1. If C — > C or C i — > C is in Fian\ then C — ^ C . 

5 Exponential Lower Bounds 

In this section we prove that all arrows in FigUl represent exponential lower 
bounds. The result follows from CTApr{C{X),Uo{X)) — ^ CTA{Cdf(X),U{X)), 

CTApr{Cdf{X)M{X)) ^ CTA{C{X)Mo{X)) and CTA{Cdf{X)M{X)) ^ 
CTApr{Cdf{X),U{X)), which are proved below. 

Simulation of diagonal constraints 

We show that CTApr{C{X),Uo{X)) — ^ CTA{Cdf{X),U{X)), i.e. simulating di- 
agonal constraints implies, in general, an exponential growth of the CTA. 

Let (^Am)m>o — (({^? J LJo<i<m^^*’ ’ ^0 ; ■ • ■ ; Am , {QQ}))m>o the fam- 

ily in CTApr{C{X),Uo{X)) with Ag and fc > 1, represented in Fig0 The 
family {Am)m>o accepts the family (Tm)m>o of timed languages over {a, b} such 
that: 
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— untimed{Lm) = {h* a? * )“ 

— every timed sequence underlying a timed word in is strictly increasing. 





Fig. 2. Automata Aq and Ak, k > 1 {qo and qt are the initial states) 



The idea is that Aq, , Am are able to count 2'"+^ occurrences of the action 
a. More precisely, Ag, . . . , Am implement a counter modulo 2™+^. 

Let 6g, . . . , 6m be coefficients in {0, 1}. When X)o<fc<m ^ occurrences of a 
have been counted, the component Ak is either in state qk, if bk = 0, or in state 
q'f., if bk = 1. So, every 2’”+^ = 1 + I]o<fc<m 2* occurrences of a, Tig, ... , Am are 
in state gg, . . . , qm, respectively, and occurrences of 6 can be read. 

Whenever a is read, Ak changes its state iff Ag, . . . , Ak-\ are in state <7g, . . . , q'k-\- 
This corresponds to the fact that bk switches from 0 to 1, or conversely, every 
1 + X)o<i<fc-i 2* occurrences of a. The component Ak infers that Ag, . . . , Ak-i 
are in state gg, . . . , q'k_i, respectively, by the fact that j/g < xqA. . .Ayk-i < Xk-i. 
This is correct, since clock yi (resp. Xi) is reset every time q[ (resp. qi) is entered. 

This example shows that a sequential component can use diagonal clock 
constraints to communicate its actual state to other components. 

Proposition 5. CTA^r{C{X),Uq{X)) ^ CTA{Caf{X)M{X)). 

Proof. Let (Lm)m>o be the family of languages accepted by the family (Am)m>o 
in CTApr{C{X) ,l4o{X)) considered above. Since Am is polynomial w.r.to m, it 
suffices to prove that every family {Bm)m>o in CTA{Cdf{X),U{X)) accepting 
(Lm)m>o is such that Bm is at least exponential w.r.to m. 

Assume any family {Bm)m>o in CTA{Cdf{X),U{X)) accepting (Lm)m>o- Since 
Bm can count 2™+^ occurrences of a, it has at least 2™+^ configurations. If such 
occurrences of a are counted by a single sequential component of Bmi then such a 
component has at least 2*”+^ states. So, it suffices to prove that the occurrences 
of a cannot be counted by n > 1 components in Bmi through any cooperation. 
Assume, by contradiction, that such components i?i, . . . , exist. In this case, 
there must be some Bi that can check whether another Bj, with j ^ i, is in some 
state qj or not. The condition “Bj is in state qf^ must correspond to some non 
diagonal clock constraint, since the components of Bm can communicate only 
through clocks. So, let (fj be the diagonal free constraint representing ^^Bj is in 
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state qj” . Let us rewrite (j)j as (j)j,i V ... V 4>j^hi with each a conjunction of 
atomic constraints x^j^q. First of all we note that no constraint Xj^i = qj^i can 
appear in since conditions ^‘Bj is in state qj" and ‘‘‘’Bj is not in state qj" 
must hold for intervals and not for discrete instants. Moreover, we can prove 
that some constraint Xjj < qjj, or Xjj < qjj, appears in (pjj for each 1 < i < h. 
In fact, in the contrary case, some step must be performed to set the clocks, so 
to maintain (j)jj (and, therefore, (pj) false when qj is not active (otherwise the 
growing of clocks will make (pj true when qj is not active) . Since such a step can 
be performed only when receiving a or b, the input rate results to be constrained. 
This is a contradiction, since in Lm the input rate is free (more precisely, the 
interval of time between two subsequent actions can be arbitrarily large). So, 
we have proved that some constraint Xjj < qjj, or Xjj < qjj, appears in pjj 
for each 1 < i < h. Now, whenever qj is enabled and pj is going to become 
false (i.e., every pjj is going to become false), a step must be performed to set 
the clocks, thus maintaining pj true. Once again, such a step can be performed 
only when receiving a or 6, the input rate results to be constrained, and this is 
a contradiction, since in Lm the rate of the input is free. □ 

Simulation of constant updates 

Here we show that CTApr{Cdf{X),U{X)) — ^ CTA{C{X),Uq{X)), i.e. simulating 
constant updates implies, in general, an exponential growth of the CTA. 

Let (Mm)m>o — (({^5 5 5 ^0 ; • ■ ■ 5 {^o}))m>o the fam- 

ily in CTApr{Cdf{X),U{X)) wiFh Ag and Ak, k>\, represented in Fig0 The 
family {Am)m>o accepts the family (Lrn,)m>o of timed languages over {a, 6}, 
where contains the timed words (a, to); (^; ^o); («; ^i); (b, t'^) . . . such that: 

— to > 5 and, for each i > 0, U+i <U + b 

— for each 0 < t < 2"*+^ — 1, ~ xn) is in a set Ti. More 

precisely, if i = J2o<k<m‘^'" ^ with bg,...,bm S {0,1}, then, for each 
0 < A: < TO, either 1 - 1/2^ € T„ if bk = 0, or 1 - 1/2'=+™ S T„ if bk = 1. 



b, ok 



a,xo < 5 
.To := 5 
zo := 1 

b, ok 



So^ 
a, To < 10 
■To := 0 
Zo := 1/2" 



ok = Vo 



= 1; A, = Ao 



a, ~^Xk 

Tfc := 0 



Zfe := 1/2'= 




Ti > 5 



qk 



a, Afe 



Zfe := 1/2'= 

b 



Sly 

a, Xk 

Xk ■■= 0 
Zk := 1 / 2 '=+™ 

Xk ■■= 5 

V := 1/2'=+™ 




Fig. 3. Automata Aq and Ak, k > 1 (go and qt are the initial states) 



The idea is that Ag,. . . , A^ are able to count 2™+^ occurrences of the action 
a. More precisely, Ag, . . . , Am implement a counter modulo 2™+^. 
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Let bo, . . . ,bm be coefficients in {0, 1}. When i = J2o<k<m ^ occurrences of 
a have been counted, the component Ak is either in a state in {qk, sj,}, if bk = 0, 
or in a state in {g^,Sfc}, if = 1- So, when the subsequent occurrence of a is 
read, the clock Zk takes either 1/2^, if bk = 0, or 1/2^+™, if bk = 1. This implies 
that the time elapsed between the a and the next occurrence of b is in the set 
Ti, which contains either 1 — 1/2^, if 6^ = 0, or 1 — if bk = 1- 

Note that the sets Tq, . . . ,T 2 m+i_i are pairwise different. 

Whenever a is read, Ak changes its state iff Aq, . . . , Ak-i are in state Qq, . ■ ■ , q'k-n 
respectively. This corresponds to the fact that bk switches from 0 to 1, or con- 
versely, every 1 -|- J2o<i<k-i 2* occurrences of a. The component Ak infers that 
Aq, . . . , Ak-i are in state q'q, . . . , q'k_i by the fact that a:o > 5 A . . . A Xk-i > 5. 
This is correct, since clock Xi is set to 5 (resp. 0) every time qi (resp. g') is left, 
and two subsequent occurrences of a are separated by less that 5 units of time. 

This example shows that a sequential component can use constant updates 
to communicate its actual state to other components. 

Proposition 6. CTApr{CdfiX),U{X)) A CTA{CiX),UoiX)). 

Proof. Let (Lm)m>o be the family of languages accepted by the family {Am)m>o 
in CTApr{Cdf{X) ,U{X)) considered above. Since Am is polynomial w.r.to m, it 
suffices to prove that every family (Bm)m>o in CTA{C{X),l4o{X)) accepting 
{Lm)m>o is such that Bm is at least exponential w.r.to m. 

Assume an arbitrary Bm in CTA{C{X) ,Uo{X)) accepting Lm- It must have at 
least 2™+^ configurations, cq, ■ ■ ■ , C 2 m+i_i, such that, whenever Ci is activated, 
an occurrence of b must be read after a time chosen in Ti. First of all we observe 
that, whenever Ci is activated, to check whether a time in Ti is elapsed we can 
observe only clocks that are reset (and not arbitrarily set, since constant updates 
are not allowed) when Ci is activated. We mean that we cannot observe clocks 
that are reset before. The reason is that the step leading to Ci is caused by an 
occurrence of a that arrives nondeterministically in the interval [t, t-|-5), where t 
is the arrival time of the previous occurrence of a. As a consequence, from every 
state in Ci at least \Ti\ transitions depart, each representing waiting for a time 
in Ti. This implies that no state in Ci can be in any configuration Cj, with j ^ i. 
It follows that at least 2™+^ states are in Bm, which implies the thesis. □ 

Simulation of private clocks 

Here we show that CTA{Cdf{X),U{X)) — ^ CTApj.{Cdf{X),U{X)), i.e. simulat- 
ing non private clocks implies an exponential growth of the CTA with diagonal 
free clock constraints and constant updates. 

Let {Am)m>i = (({a}, {x}, A^,..., Am, {go}))m>i be the family in 
CTA{Cdf{X) ,U{X)) such that Ak, fc > 1, is represented in Fig^j 
The family {Am)m>i accepts the family {Lm)m>i of timed languages over {a} 
containing the timed words (a, ti)*(a, ^ 2 )* ... (a, tn)* . . ■ such that: 

— ti = 2m -|- 1 and ^2 ~ ti S {2m -1-1 — 1, 2m -|- 1 — 2, ... , 2m -I- 1 — m}; 

— ti — ti-i G Ti implies —UG {Ti \ {2m -|- 1 — A:}) U (2m -|- 1 — (A; -|- m)} or 
ti+i — ti G {Ti \ {2m -I- 1 — (A: -I- m)}) U |2m -I- 1 — A:}, for some 1 < A: < m. 
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X = 2m + 1 




X = 2m + 1 



Fig. 4. The automaton At, k > 1 {qt is the initial state) 



At instant ti = 2m + 1, there is a finite sequence of n > 0 steps from configu- 
ration (< 7 i , . . . , ( 7 m) to itself. Moreover, there is a step where an automaton Ak, for 
some 1 < fc < TO, leaves state qk, enters state q'j. and sets the clock x to value k, 
and where automata Ai, , Ak-i, . . . , Am perform the transitions leaving 

and entering states qi, . ■ ■ , < 7 fc+i, ■ • ■ , qm, respectively. Now, the next step is 

performed at instant t 2 = + {2m -I- 1 — A:). Since k can be arbitrarily chosen 

in {!,..., to}, it holds that t 2 — ti € {2m - 1-1 — 1 , 2m -I- 1 — 2 ... , 2m -I- 1 — to}. 

Now, assume that at time ti-i Ak leaves qk (resp. q{), enters q{. (resp. qk) and 
sets X to k (resp. k + to), and that Ai, . ■ ■ , Ak-i, Ak+i , . . . , Am do not change 
state. The time ti — ti-i that elapses until the next step is performed is 2 to-|- 1 — A: 
(resp. 2m -I- 1 — (A: -I- to)). Moreover, the time that elapses between ti and ti+i 
cannot be 2m + 1 — k (resp. 2m -I- 1 — (A: -I- to)), and can be 2m + 1 — {k + m) 
(resp. 2 to-|- 1 — A;). In fact, the transition from ( 7 } to qk (resp. from qk to q'f.) can 
be performed at time ti, and that from qk to g} (resp. from q{ to qk) cannot. 

Proposition 7. CTA{Cdf{X),U{X)) -4 CTApr{Cdf{X),U{X)). 

Proof. Let {Lm)m>i be the family of languages accepted by the family {Am)m>i 
in CTA{Cdf{X),U{X)) considered above. Since Am is polynomial w.r.to to, it 
suffices to prove that every family {Bm)m>i in CTApr{Cdf{X) ,U{X)) accepting 
{Lm)m>i is such that Bm is at least exponential w.r.to to. 

Assume, by contradiction, that a family {Bm)m>i in CTApr{Cdf{X),U{X)) ac- 
cepting {Lm)m>i exists which is not exponential w.r.to to. 

Since Bm accepts words (a, ti)*(a, ^ 2 )* ■ • ■ (a, An)* ■ • where U+i — ti can be in 
one of 2™ different sets, Bm has at least 2™ different configurations simulating 
configurations ci , . . . , C 2 ^ of Am ■ Therefore, Bm cannot have only one sequential 
component. So, let us assume that it has n sequential components, B\,. . . , Bn. 
Let us call the set of possible values of A^+i — U. 

Since T 2 = {2to-|-1 — 1, 2to-|-1— 2, . . . 2to-|-1— to} and, for each i > 2, Ti_|_i is either 
(Ti\{2TO-|-l — A:})U{2 to-|-1— (A:-|-to)} or (T^ \{2 to-|- 1— (A:-|-to)}) U{2to-|- 1 — A:} 
for some 1 < A: < to, it holds that from each configuration Ci it is possible to 
reach to configurations in {ci, . . . , C 2 m} through a step. Let . . . , Ci^m be the 
configurations such that Cij is reached from Cj when Aj changes state and as- 
signs Tj € {f,j + to} to X, and Ai, . . . , Aj-i, Aj+i, . . . Am do not change state. 
Assume that c, is the configuration entered, i.e. time elapsed between enter- 
ing Ci and exiting Ci is in set T„. 

Let si,...,s„ be the states in the sequential components Bi, . . . , B„, respec- 
tively, such that Ci = (si, . . . , s„). There is a total function / : {1, . . . , to} — >■ 
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such that, given any 1 < h < m, there are: 1) a state s/(/t) with 
f{h) € 2) a transition from state s/(/i) to a state assigning 

a value jh to some clock a::/(?i) of 3) a transition either from or 

from a state that is entered whenever is, guarded by a clock constraint 

^f{h) = where such a guard checks that the amount of time 2m + 1 — t /, is 
elapsed. The transition from is needed because there must be a step leaving 
Ci^h 2m + 1 — Th instants after Ci^h is entered. 

Now, given hi ^ h 2 , in general it holds that /(/ii) ^ 1(^2), otherwise there is 
a state in Bm for each configuration of Am, thus contradicting that Bm is not 
exponential w.r.to m. So, let us assume that /(/ii) ^ /(ft- 2 )- There is a step from 
(si, . . . , Sn) to a configuration where both and are active. In such a 

configuration, both a transition guarded by the clock constraint Xf{hi) = Qf{hi), 
and a transition guarded by the clock constraint Xf(h-i) = 9 /(ft. 2 ) 
formed. This implies that an occurrence of a can be read in this configuration 
2m + 1 — or 2m + 1 — units of time after it is entered. This implies 
that Tu+i \ Tu has two elements, thus contradicting that Bm accepts Lm- □ 

From Propositions 0, El and 0 it follows that all arrows in Fig0 represent 
exponential lower bounds. 

Corollary 2. If C — > C or C < — > C is in FmlTl then C — ^ C . 
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Abstract. We study the power of finite machines with infinite time to 
complete their task. To do this, we define a variant to Wojciechowski 
automata, investigate their recognition power, and compare them to in- 
finite time Turing machines. Furthermore, using this infinite time, we 
analyse the ordinals comprehensible by such machines and discover that 
one can in fact go beyond the recursive realm. We conjecture that this 
is somehow already the case with Wojciechowski automata. 



Introduction 

Finite automata on infinite sequences have been introduced by Biichi in 
to prove the decidability of the monadic second order theory of (a;,<). Biichi 
automata differ from finite automata on finite sequences only by its condition 
of acceptance of a word. A word is accepted by a Biichi automata if and only 
if the set of states, through which the automata goes an infinite number of 
times during an exeeution (there may be several executions if the automata is 
nondeterministic), contains at least a final state. 

Independently, Muller introduced in El deterministic automata on infinite 
words such that a word is accepted if and only if the set of states, through which 
the automata goes an infinite number of times during an execution (there may 
be several executions if the automata is nondeterministic), belongs to a table 
given with the automata. 

McNaughton proved in m that (deterministic) Muller automata and nonde- 
terministic Biichi automata accept the same infinite sequences and that those in- 
finite sequences can be described by w-regular expressions of the form 1 J”^q 
where ai and Pi are regular expressions of finite words and such that none of 
the Pi’s contain the 0 symbol. 

Then Biichi introduced in |0j finite automata that are able to describe 
transfinite sets of sequences. Using those automata, he proved the decidabil- 
ity of the monadic second order theory of (a, <), where a is a countable 
ordinal. It featured special transitions for limit ordinal stages such that the 
state reached at that limit stage ^ depends only on previously reached states 
{s G ^ I V/3 < ^ > /3 (p{^) = s}. He modified again the definition to use them 

to prove the decidability of the monadic second order theory of (wi, <). 

Choueka generalized in 0 Biichi’s ideas to get automata on transfinite se- 
quences of length less than w" for a given n < u. The difference with Biichi’s 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 252-|5^ 2001. 
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approach is in the behaviour at limit stages : for a limit ordinal ^ = /3 + w" 
{n < u! and /3 = 0 or /3 > w"), s belongs to if and only if there is an infinite 
number of fc < a; satisfying (p{(3 + ■ k) = s. Note that ip is only defined for 

n < uj and that i^(^) depends only on These automata recognize (as 

the Biichi automata) words that can be described by regular expressions similar 
to Biichi’s w-regular expressions. 

Wojciechowski then introduced in US] automata on arbitrary transfinite se- 
quences, similar to Biichi’s, and corresponding regular expressions (in [I tij). de- 
scribing words recognized by these automata, with a D operatou that is the 
transfinite analogue of the to operator. If we work only on words used by Choueka 
automata, Wojciechowski and Choueka automata are equivalent. Wojciechowski 
automata are thus the automata that we will use for our base study. 

Following those definitions, several studies of Bedon, Carton and Perrin |4III 
I2I31 1 2j have focused on generalization of classical finite automata theorems and 
characterizations to infinite and transfinite words. In fact, it is Perrin and Pin’s 
m that brought some years ago the study of finite automata on infinite words 
to the author’s attention and stirred his interest in the field. 

ui is only the first infinite ordinal, hence automata on arbitrary transfinite 
sequences constitute a natural generalization of automata on infinite words, read 
cj-indexed words. Studying the behaviour of automata on ordinals is also liable 
to lead us to the right concepts for automata at large, including automata on lu- 
words. Moreover, these automata have potential applications in semantics and 
in decidability problems, e.g. Biichi’s results. Muller had also introduced his 
automata on infinite words in order to study the behaviour of asynchronous cir- 
cuits. Physicists have already investigated tasks involving infinitely many steps, 
called supertasks, the first of which is done, for example, in a half second, the 
next in a quarter second, and so on, so that the entire job is completed in a 
finite amount of time. More useful supertasks, perhaps, have been proposed to 
determine the truth of an existential number-theoretic question. Physicists have 
been surprisingly able to construct general relativistic models in which the su- 
pertasks can apparently be carried out. But our interest resides in not so much 
finding what is physically possible to compute in a supertask so much as what 
is mathematically possible. 

In order to study the power of infinite time, we define a natural extension 
to Wojciechowski automata, called W^-automata. W^-automata work on a two- 
way w-tape and have pebbles that appear only after an infinite amount of steps 
after their usage. The two-way tape lets us study the recognition power of our 
automata on common objects as finite sequences on a finite alphabet, read finite 
inputs, or cj-sequences on a finite alphabet, read reals for input. This lets us com- 
pare the power of recognition of our automata to that of common abstract state 
machines, e.g. pushdown automata, counter machines, Turing machines. . . A first 
surprising result about those automata is the power obtained by infinite time 
combined with the eventual pebbles. By using the non-bounded property of in- 

^ a* is either the empty word or the letter a repeated a finite number, an infinite 
number or a transfinite number of times. 
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finite time, we manage to use potentially an infinite amount of pebbles and 
bridge directly the gap between (finite time) Turing computability and regular 
languages. 

Still another model of supertasks is infinite time Turing machines. Jeffrey 
Kidder defined infinite time Turing machines in 1989, and he and Joel Hamkins 
worked out the early theory at Berkeley. With a fresh look in 1996, a few of 
the formerly puzzling questions were answered, and a more advanced theory 
developed and appeared in |H| . Some still more puzzling questions were answered 
later on in 2000 by Philip Welch in PI to arrive now at a more mature theory. 

We begin by recalling in section 1 the definitions of Biichi and Wojciechowski 
automata and by giving some simple examples of them. Next in section 2, we de- 
fine W^-automata and investigate their power compared to pushdown automata. 
We also prove a “pumping lemma” (lemma E|) for Wojciechowski automata in 
order to differentiate their power to W^-automata’s. Our analysis of the power 
of W^-automata leads in section 3 to the comparison with infinite time Turing 
machines (theorem 0 and EJ. We highlight, in section 4, the ordinal oriented 
aspects of our study. We show in theorem El that the ordinals comprehensible 
by W^-automata go much farther than what is usually thought and conjecture 
about ordinals comprehensible by regular Wojciechowski automata. 

1 Infinite Words and Finite Automata 

1.1 Biichi and Wojciechowski Automata 

Definition 1. A Biichi automaton is a quintuplet A = {Q, Z, F) where 
{Q,E,5) is a finite automaton, and Z and F are subsets of Q called respectiuely 
set of initial states and set affinal states. 

A finite path of A is successful if its origin is in Z and its end is in F. An 
infinite path is successful if its origin is in Z and it goes infinitely often through 
states of F. The set of finite (respectively infinite) words recognized by A is the 
set of labels of successful finite (respectively infinite) paths of A. A set of words 
M is recognizable if there is an automata A that recognizes precisely the words 
of M. 

The set of words described by {a, 5}*a“ is not recognizable by a determin- 
istic Biichi automaton. However it is recognizable by a nondeterministic Biichi 
automaton. Nondeterministic Biichi automaton are thus more powerful than de- 
terministic ones. Muller automata give a more satisfying definition from that 
point of view. 

Let us now consider words u of a finite alphabet E as a-sequences (a is 
an ordinal) on E. We now define continuous a-sequences in order to define 
Wojciechowski automata. 

Definition 2. An a-sequence (p on Q is continuous if for any successor ordinal 
(3, y>(/3) S Q and if for any limit ordinal (3, 



V^(/3) = {? G <3 I {7 < /3 I V?(7) = 9} is cofinal to (3} 
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Remark 1. A continuous a-sequence is completely determined by its values at 
successor ordinals {</?(/3) | /? G Suec,fi < a}. 



Definition 3. A Wojciechowski automaton, or W-automaton, is a quintuplet 
A = (Q, S, SAt P) with U a finite alphabet, Q the finite set of its states, 6 C 
p{Q) X E X Q the transition relation, i G Q the initial state, and F C p{Q) the 
finite set of its final states. 

From now on, by state we mean an element of p{Q) = P{Q) U Q. 

Definition 4. A run with label u G S'^ , a G Ord, of an yV-automaton A = 
{Q, S, 5, i, F) is a eontinuous {a + l)-sequence p such that </j(0) = i and for all 
/3 < a, 

{p{P),u{P),p{(3 + 1)) e (5 
A run is successful if p{a) € F. 

A word u G is recognized by A if and only if there is a successful run of A 
with label u. A set of words M is recognizable by A if this automaton recognizes 
only the words belonging to M. 

One can easily build (see 0) W-automata recognizing respectively , a^, 
{a“ I a G Succ}, {a“ | a G Lim} and 

Here is an exampl^^ of an automaton recognizing a“ for a < uji but not 
recognizing . 

Example 1.1 



a 

n 




u 



Fig. 1. A W-automaton recognizing a“ for a < u>i 

^ A square state is a state belonging to P{Q). A final state is with an inner (smaller) 
circle. 

® Why? See IS]. This example prompts questions about recognizable ordinals: for what 
ordinal a, is there an W-automaton recognizing the singleton a“? for what ordinal 
a, is there an W-automaton recognizing a word of “length” a? For some time, it was 
commonly thought that we would not go further than eo (not to talk about 
We come back to those questions at the end of the paper. 
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2 Infinite Time Finite Automata on Reals 

2.1 Infinite Time Behaviour 

We propose the following variant of Wojciechowski automata. It will hopefully 
turn out to help us in understanding Wojciechowski automata and infinite time 
machines at large. 

A 2-way Wojciechowski automaton, or W^-automaton is a W-automaton on 
a 2-way countable tape. The special behaviour of the automaton compared to 
normal Wojciechowski behaviour is at limit stages. At a limit stage, the head of 
the tape is plucked from wherever it might have been racing towards, and placed 
on top of the first cell. 

If we have a finite input, there are several ways of delimiting the input. 
We can have special end markers h and H that indicate the boundaries of the 
input. We will opt for a letter b representing “blank” cells (we are working with 
“infinite time” . . . surely, we cannot restrict ourselves to a finite accessible part 
of the tape). 

An W^-automaton also has its tape changed at limit stages in the following 
way : if at a previous successor stage the head of the tape stayed (stopped) on 
a cell for at least one stage, this cell has now the value t] if it didn’t before and 
is erased to b if it already contained the value t] (the value of the cell is toggled 
between t] and b). 

Definition 5. A W^-automaton is a quintuplet A = {Q, S,S,hP) with: E a 
finite alphabet, Q the finite set of its states, S C p{Q) x AU{b, \\}xQx {^,4,, — >■} 
the transition relation, i G Q the initial state, and F C p{Q) the finite set of its 
final states. 

A run with label a £ {SU {b, (a is a function of the form a ^ {uj ^ 

(A U {b, l]}))4, a£ Ord, of an -automaton A = (Q, E,S,i, F) is a continuous 
{a l)-sequence {pq,Ppos) on p{Q) x N such that Pq{0) = i, Pposi^) = 0 and 
cr(0) £ A“ is the original content of the tape and for any ordinal f) < a, 

{pq{P),a{/3){(ppos{/3)),Pq{l3 -k l),p{ippos{P + 1) ~ Vpos{fi))) £ S (2.1) 
where p{l) =— >■, p(0) =J, and p(— 1) =t— , 

a is the evolution of the tape’s content : <j{(3) is the tape’s content at stage (3. 
(fiq and (fipos give respectively the state of the machine and position of its tape’s 
head at every stage. (ED says that from one stage to the next one, the machine 
obeys its transition relation S. 

and if P is a successor ordinal, cr(/3) = cr(/3 — 1), otherwise (fi is a limit 
ordinal), Vn £ io, 

(a\( \ _ / */ 3/3- < 7 < /3 such that Pposil + 1) = Ppos{l) = n, 

o'lPJWJ (cr(/3_)(n) otherwise, 

and ippos(P) = 0, where /3_ is the greatest limit ordinal < and for all a £ 
E U {b, t]}, r(o) = t] if a \\, b otherwise. 
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a only changes at limit ordinal stages, where the tape’s head goes back to 
the origin and the cells where the head stopped before this limit stage (and after 
the previous limit stage) alternate between t] and b : if the value belonged to 
S U {b}, it is now if it was t], it is now b. If we are at an hyper-limit (limit of 
limits) ordinal stage, the value of the cell is the converging value if there is one, 
otherwise. 

A run is successful if ^q{a) € F. A word u G is recognized by A if and 
only if there is a successful run of A with u as the original tape content. A set 
of words M is recognizable by A if this automaton recognizes only the words 
belonging to M. 

The reader is invited to guess what language the following W^-automaton 
recognizes. 



Example 2.2 




Fig. 2. A W^-automaton recognizing . . . (see proposition!^ 



We will work with an alphabet of only two letters, say {0, 1} or {a, b}, beyond 
the other technical letters as b and 

In order to justify our definition of W^-automata, we prov^ the following 
simple lemmae. 

Lemma 1. Every 2-way finite automaton ean be transformed in a never stop- 
ping 2-way finite automaton with the same language of reeognized words. 



Lemma 2. The following two variants of -automata recognize the same lan- 
guages (of finite or infinite words) than -automata : 

— put to at limit stages, only those cells where the head has stopped previously 
infinitely often; 



For the proof, see 0. Due to space limitations, this is applicable whenever the proof 
is omitted. 
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— replac^ the “change the stopped cells at limit” behaviour by having pebbles 
who are used in the same way than in traditional pebble automata with the 
difference that the pebbles are actually placed only at limit stages on the 
requested cells, i.e. where the automaton previously asked to place them. So, 
the pebbles become only “visible” then. 



Proposition 1. = {a"6" | n G N} recognizable by -automata. 

Proof. We describe the behaviour of the required automata : 

1. first, we need to verify that the tape is composed by a’s followed by b’s; 

2. we repeat the following two points at either the state following the previous 
point or at the limit state {gj} : 

a) check that there are still b’s at the end of the tape, 

b) starting from the first cell, we look for the first a on the tape (after the 
potential l]’s) and loop (stopping there) on a special state gp 

c) from the {g|} state, we look for the first b (after the potential t] and a’s) 
and loop (stopping there) on a special state gj , 

3. if there are no more b’s at the end of the tape and only natural’s on the 
tape, we enter the “accept” state. 

The described automata clearly recognizes only the words belonging to Leg. 

The reader should have recognized the behaviour of the automaton of exam- 
ple 2.2 and thus guessed the language recognized. 

Theorem 2. Algebraic languages C languages recognized by -automata. 

Proposition 2. The languages Lpai = {s G {a, b}* \ s = s-^}, Legi/s = {a"6"a” 

I n G N}, Leq^.rn = {a”6™a"6’" | n,m G N}, L^ieq = {ss | s G {a, 6}*} are 
recognized by -automata even if some are not algebraic. 

Proposition|3shows that W^-automata recognize strictly more than algebraic 
languages using the infinite time at its disposal. We will see in the next section 
where the limit is. 

Do W^-automata gain something compared to W-automata? To see this, we 
prove the following “pumping lemma” for W-automata. 

® The transition relation now has two more fields (x{o, •} x {yes, no}) : the first one 
looks for the presence (•) or absence (o) of a pebble on the cell and the second 
commands the dropping of the (not yet visible) pebble. A run with label cr and 
pebbles labels OpehUes G Odrop G {yes, no}““ is then now a continuous 

{a -\- l)-sequence {ipq, gipos) such that apebbies{^)(n) = o for all n and for any ordinal 

f) < a, (ipq{0),<T{0){iPpoa{P)),Vq{P-\-i),p{Vpoa{0 + i)-Vpoa(P)),aj,ebbles{S){rpos{0)),a,l„p{0){q>po,{P)))& 

6 Now, only the pebble monitor {a pebbles) changes; the tape’s content does not change 
anymore. At limit stages, Cpebbies reflects the pebbles (virtually) dropped (udrops) at 
earlier successor stages . 
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Lemma 3. Let A = {Qa, A!, Sa, Fa) be an W- automaton. For eaeh reaehable 
state q € P{Qa) (there is a run {p of A going through q), there is a state q' € Qa 
sueh that q' € q and a sub-run out of ip going from q' to itself without going 
through a state of P{Qa) (^VA' ^ Ord Lpfq) = g}{rj') = q' et \/rj < f < rj' (p{f,) ^ 
P{Qa))- 

Corollary 1. W^-automata are strictly more powerful (recognition-wise) than 
W -automata. 

Proof. The argument is exactly the usual one, proving that Leg is not regular, 
with our pumping lemma Olinstead of the usual finite automata pumping lemma. 

3 Compared to Infinite Time Turing Machines. . . 

3.1 Infinite Time Turing Machines 

Hamkins and Lewis |B| defined infinite time Turing machines. They simply ex- 
tend the Turing machine concept into transfinite ordinal time. 

Definition 6. An infinite time Turing machine is a Turing machine with three 
separate tapes, one for input, one for scratch work, and one for output. The 
scratch and output tape are filled with zeros at the beginning of any computation. 
At non-limit stages, it behaves like a normal Turing machine according to its 
transition relation. At limit stages, the head is plucked from wherever it might 
have been racing towards, and placed on top of the first cell. Moreover, it is placed 
in a special distinguished limit state. For a given cell of the tape, at a limit stage 
it takes the value of the Urn sup of the cell’s values before the limit. 

Hamkins and Lewis studied the theory of such infinite time machines and 
obtained a nice theory. The power of these machines lies between I!) and A^. The 
finite time Turing machine halting problem becomes decidable. They generalize 
the classical s-m-n and recursion theorems and study the basic features of the 
structure of infinite time degrees. 

3.2 Where Is the Limit? 

Theorem 3. -automata recognize any (finite time) decidable language. 

thus really have more recognition power than finite automata or even 
pushdown automata. 

Corollary 2. All the languages in theorem\^ and proposition\^ are recognizable 
by yV^ -automata. 

We now know that sets recognizable (which is the same thing as writable 
sets from scratch) with finite time Turing machines are recognizable by W^- 
automata. The W^-automata used so far are with a N-tape. Having a Z-tape 
bridges the gap to infinite time Turing machines. Of course, W^-automata have 
the same power of recognition as W|-automata. It suffices to encode the right 
part of the Z-tape on even positions of the N-tape and folding the left part on 
the odd positions. 
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Theorem 4. -automata recognize any infinite time decidable language. 

Proof. Having a Z-tape, we can encode the two counters at the left of the origin 
{first celt) of the tape and use the right part of the tape as an N-tape for the 
one-tape Turing simulation. 

Using Wojciechowski-type limit states, we can easily simulate the one-limit- 
state behaviour of infinite time Turing machines. In the simulation of the Turing 
machine by the two-counter automaton, when the transition says to change from 

0 to 1 or vice versa we need to go through a special state qum sup to ensure that 
if it changes infinitely often, we will able to notice it and simulate a value of 1 
(the lim sup of 0 and 1). 

Remark 5. The simulation of the “one-limit-state behaviour” in the proof of 
theorem 0 takes at most up' steps. By doing a crude job, we can make it take 
exactly uP steps. 

Theorem 6. -automata recognizable languages are infinite time decidable. 

Proof. The only difficulty is in simulating the Wojciechowski-type limit states. 

We take some place on the work tape to put flags, initialized to 0, for each 
state of the simulated automaton. When we simulate the automaton and are in 
state qnow > we put the flag corresponding to qnow to 1 and put to zero the other 
flags corresponding to states q p qnow Hence, at a limit stage our infinite Turing 
machine will be in its limit state and by looking at the flags on the scratch tape, 
we will be able to see what are the states through which our machine has gone 
infinitely often (whose flags have toggled infinitely and so are now equal to the 
lim sup 1) and thus what Wojciechowski-type limit state we should simulate. 

4 Machine Comprehensible Ordinals 

We have already seen in the examples of section H that we can construct W- 
automata to recognize only the word a“ for certain ordinals a like to and . For 
certain ordinals A like loi, we can construct W-automata recognizing all a“ for 
a < A but not . 

For machines with a countable tape, we can code countable ordinals on the 
tape by the encoding of its well ordering G in a real : a well order R is encoded 
in r G K. (r is a sequence of 0 and I’s) by 

for all n,m G N, nRm if and only if ri^^.m) = 1 

where (., .) is a canonical pairing function. Hamkins and Lewis have shown that 
many ordinals are writable (in the above way) from scratch (blank input tape) 
by infinite Turing machines. 

Theorem 7. Infinite time Turing machines can write the ordinal a for any 

01 < Xinf where \mf is a recursively inaccessible countable ordinal. Hence, the 
writable ordinals cover easily the recursive ordinal^, ujp and much more. 

® A countable ordinal is recursive if encodable, as described above, by a Turing com- 
putable real. u)p (ck is for Church-Kleene) denotes the smallest non recursive ordinal. 
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Proof. See [H|- 



Corollary 3. -automata can recognize the ordinal a for any a < Xinf, hence 
all the recursive ordinals, and much more. 

Proof. By theorem 0 



Definition 7. An ordinal a is dockable by a class of machines C if there is 
fo S CdoTiJH and a machine belonging to C that halts in the “accept” state on 
input rg in a many steps. 

An ordinal a is all-til dockable by a class of machines C if a is not dockable 
and there is a machine M belonging to C such that for all (3 < a, there is 
rp G Cdom such that M halts in the “accept” state on input rp in j3 many steps. 

For W-automata, recognizable and dockable ordinals are the same because 
of the one-way nature of the tape’s head. As we saw at the beginning of this 
section, to and are dockable by W-automata and is all-til dockable by 
W-automata. 

Theorem 8. Recursive ordinals, wf*’ -I- lo and many ordinals less than an un- 
reachable ordinal 7 i„/ are dockable by infinite time Turing machines with many 
gaps in between. In fact, no admissible ordinal is cloekable. 

Proof. See (S). 



Corollary 4. For every dockable (by infinite time Turing machines) ordinal 
a < 7 m/) there is a countable ordinal (3 > a such that fi is dockable by W^- 
automata. 

Welch has shown the following theorem. 

Theorem 9. 7 ™/ is in fact Xinf and so is recursively inaccessible. 



Corollary 5. the supremum of dockable ordinals by -automata, is 

equal to Xinf, the supremum of writable ordinals by infinite time Turing ma- 
chines, and is recursively^ inaccessible. 

Proof. Use corollary 0 and Welch’s theorem 0 



Lemma 4. Every ordinal up to is dockable by -automata. 

Proof. If a is dockable by W^-automata, then so are a -I- 1 and a -\- uj. 

We can go much farther for ordinals dockable by W^-automata : 

^ Cdom is the set of inputs accepted by machines of class C. Of course, for C = W^- 
automata, Cdom = R; and 'W— automata dom is transfinite sequences on E. 
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Theorem 10. Recursive ordinal^ are dockable by -automata. 



Theorem 11. The ordinal -\- uP is dockable by -automata. 

See j0| for the proofs of theorems El and EH 

Theorem 12. If an ordinal a is dockable by infinite time Turing machines, ioa 
is dockable by -automata. 

Proof. Remark El tells us that for a task that will take w steps (one limit stage 
behaviour) on an infinite time Turing machine, we can crudely simulate it in 
steps with an W^-automaton. 



All-til dockable ordinals are more of a mystery than dockable ordinals. Rec- 
ognizable ordinals are not comparable inclusion-wise with all-til dockable ordi- 
nals, even if both families of ordinals contain the dockable ones. No admissible 
ordinal is dockable (see |S|) even if some are writable (or recognizable). Even 
is an all-til dockable ordinal by W-automata! 

The picture for comprehensible ordinals by infinite time machines is the fol- 
lowing. recognizable ordinals 



dockable ordinals 



\ 



all — til dockable ordinals 

Using theorem cni one can construct an W^-automaton proving that 
is all-til dockable by such automata. All-til clockability is clearly an intriguing 
property showing the extent of the power of the underlying machines. There 
is some reasons to think that many ordinals are even all-til dockable by W- 
automata. 

We conjecture the following. 

Enthusiastic Conjecture. Man^ propei0 all-til dockable ordinals by W^- 
automata are all-til dockable by W-automata. 

Concluding Remarks 

The remarkable property of W^ is that by adding very few power (or features), 
compared to W-automata on a two-way tape, we obtain directly Turing power. 
Moreover, when using the pebble variant (see lemma of definition to be 
able to get (finite time) Turing recognizability power, we only need to use very 
few pebbles and more surprisingly, infinitely rarely. 

In section E] we see that there are many more (and greater) ordinals compre- 
hensible by finite state machines than usually thought. We conjecture that this 
is also the case with original W-automata. 

® See the note for theorem [3 
® At least some other than u>i. 

Non-clockable but still all-til dockable. 
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Less raving questions are the following : 

Is wf* all-til dockable by W-automata? Is there some all-til dockable (by 

W-automata) admissible ordinal? 

Acknowledgements. I would like to thank Jacques Mazoyer for his remarks 

on this paper, his suggestions and his boundless enthusiasm. I am especially 

indebted to Jean-Eric Pin who started me off on this topic years ago. 

References 

1. N. Bedon, Finite automata and ordinals, Theoretical Computer Science 156 (1996), 
119-144. 

2. , Automata, semigroups and recognizability of words on ordinals, Interna- 

tional Journal of Algebra and Computation 8 (1998), 1-21. 

3. , Langages reconnaissables de mots indexes par des ordinaux, Ph.D. thesis, 

Universite de Marne-la-Vallee, 1998. 

4. N. Bedon and O. Carton, An Eilenberg theorem for words on countable ordinals, 
Latin’98: Theoretical Informatics (Claudio L. Lucchesi and Arnaldo V. Moura, 
eds.), Lect. Notes in Comput. Sci., vol. 1380, Springer- Verlag, 1998, pp. 53-64. 

5. J. R. Biichi, On a decision method in the restricted second-order arithmetie. Logic, 
Methodology, and Philosophy of Science: Proc. 1960 Intern. Congr., Stanford Uni- 
versity Press, 1962, pp. 1-11. 

6. , Decision methods in the theory of ordinals. Bulletin of the American Math- 

ematical Society 71 (1965), 767-770. 

7. Y. Choueka, Finite automata, definable sets and regular expressions over w" -tapes. 
Journal of Computer and System Sciences 17 (1978), 81-97. 

8. J. D. Hamkins and A. Lewis, Infinite time Turing machines. Journal of Symbolic 
Logic 65 (2000), no. 2, 567-604. 

9. G. Lafitte, How powerful are infinite time machine?, RR 2001-16, LIP, Ecole Nor- 
male Superieure de Lyon, March 2001, ftp://ftp.ens-lyon.fr/pub/LIP/Rapports 
/RR/RR2001/. 

10. R. McNaughton, Testing and generating infinite sequences by a finite automaton. 
Information and Control 9 (1966), 521-530. 

11. D. E. Muller, Infinite sequences and finite machines. Proceedings of the Fourth 
Annual Symposium on Switching Circuit Theory and Logical Design (Chicago, 
Illinois), IEEE, October 1963, pp. 3-16. 

12. D. Perrin, Recent results on automata and infinite words. Mathematical founda- 
tions of computer science (Berlin) (M. P. Chytil and V. Koubek, eds.). Lecture 
Notes in Computer Science, vol. 176, Springer- Verlag, 1984, pp. 134-148. 

13. D. Perrin and J.-E. Pin, Mots infinis, to appear, 1997. 

14. P. Welch, The length of infinite time Turing machine computations. Bulletin of the 
London Mathematical Society 32 (2000), no. 2, 129-136. 

15. J. Wojciechowski, Classes of transfinite sequences accepted by finite automata, An- 
nales Societatis Mathematicae Polonae, Fundamenta Informaticae VII (1984), no. 2, 
191-223. 

16. , Finite automata on transfinite sequences and regular expressions, Annales 

Societatis Mathematic* Polon*, Fundamenta Informaticae VIII (1985), no. 3-4, 
379-396. 




Equivalence Problem of Composite Class 

Diagrams 



Qirts Linde 
University of Latvia, 

Raina bulvaris 29, LV-1459, Riga, Latvia 
glindeOacm . org 



Abstract. Multiplicity constraints in a UML composite class diagram 
may be inconsistent. An algorithm is given for eliminating all such in- 
consistencies. Using this algorithm an algorithm is constructed which 
for two given composite class diagrams solves the equivalence problem. 
These algorithms can be embedded in CASE tools for automated detec- 
tion of multiplicity inconsistencies. 



1 Introduction 



As object modeling gains popularity, starting from the pioneering work HI , a 
need emerges for methods of handling and evaluating models. Object models 
are used in various stages of system development - from requirements definition 
to system design. One problem is to validate the model as it gets transformed and 
refined in the lifecycle of system development. Research is already being done 
on this topic |2ldl4l5| . A more general problem is to compare several alternative 
object models of the same ’’real world”. 

The ultimate goal of our current research is to formalize this ’’intuitive” 
equivalence for object models represented with UML class diagrams. Consider 
the class diagrams modeling directed graphs shown on Figure E 

Formally, we have here 3 totally different class diagrams. Each of them de- 
scribes different instance diagrams. But intuitively we know that all these 3 
diagrams describe the same ’’real world” - directed graphs. 

Formalization of this ’’intuitive” equivalence is based on composite classes 
and before we can proceed, certain problems have to be solved. The equivalence 
of composite classes is one of the problems, and it gives immediate applicable 
results. The results of the research on the equivalence of composites are presented 
in this paper. 



2 Composites in UML Class Diagrams 

Composition is a form of aggregation with strong ownership and coincident life- 
time of part with the whole, as described in pm. The parts of a composition may 
include classes and associations. The meaning of an association in a composition 
is that any tuple of objects connected by a single link must all belong to the 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 264- 17731 2001. 

© Springer- Verlag Berlin Heidelberg 2001 
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Fig. 1. Three class diagrams of a directed graph 



same container object. One of the ways of showing composition is by graphical 
nesting of the symbols of the elements for the parts within the symbol of the 
element for the whole. A nested class element may have a multiplicity within its 
composite element. The multiplicity is shown in the upper right corner of the 
symbol for the part. An example of a composite class ’’Cat” and its instance are 
shown in Figure El 




Fig. 2. A composite class and its instance 



In this paper we consider only a simplified class diagram (only binary asso- 
ciations and only four most popular multiplicity intervals: 1..1, 0..1, 1..*, 0..*) 
that represents the contents of one composite class. Such diagram we will call a 
composite class diagram. 

Actually, any class diagram can be regarded as the content of the composite 
class ’’World” where every class has multiplicity 0..* if no other multiplicity is 
specified. 
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3 Basic Definitions and the Main Theorem 

Definition 1. A composite class diagram (CCD) is a class diagram consisting 
of classes and binary associations, where every class has a multiplicity and only 
four multiplicities for classes and associations are allowed: 1..1, 0..1, 1..*, 0..*, 

Definition 2. Two CCDs are identical if they are equal as labeled graphs (where 
graph labels are the names and multiplicities of classes and associations). 



Definition 3. An instance of a CCD X is an instance diagram that satisfies all 
the multiplicity constraints of classes and associations of X. 

Definition 4. Two instance diagrams are identical if they are equal as labeled 
graphs (where graph labels are the class names of objects and association names 
of links). 

Definition 5. Two CCDs X and Y are equivalent if their corresponding sets 
of instances are identical (i.e. for every instance i of X there exists an instance 
i' ofY such that i and i’ are identical, and for every instance i of Y there exists 
an instance i' of X such that i and i’ are identical). 

As it turns out, the equivalence problem is not trivial. Figure 0 shows three 
composite class diagrams that look quite different, but it can be proved that 
they describe the same set of instance diagrams, so they are equivalent by our 
definition. 




(a) 



(b) 




( 0 ) 



Fig. 3. Equivalent composites 
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The diagram (c) in FigureElhas the strongest multiplicities and (incidentally) 
it is the most precise of all possible composite class diagrams that describe the 
same set of instance diagrams. We will introduce a special term - a canonical 
composite class diagram - to denote diagrams with this property. 

The main result of this paper is the following Theorem. 

Theorem 1. The equivalence problem of CCDs is algorithmically decidable. 

4 Reduction Rules 

The algorithm for solving the equivalence problem of CCDs is based on the 
reduction of inconsistent multiplicities within those CCDs. Before we present 
the reduction rules, we will introduce notation for working with multiplicities of 
classes and associations. 

Consider the following fragment of a CCD - two classes A and B, connected 
with an association R (Figure^. 





Fig. 4. A fragment of a composite class diagram 



The fact, that the association R connects the classes A and B, we will denote 
as <A — R— B>. Multiplicities of the classes and the association ends we will 
denote as card{A), card{B), card(R.A) and card(R.B), respectively. 

Now we will introduce the multiplicity reduction rules for elimination of mul- 
tiplicity inconsistencies in CCDs. There are 6 multiplicity reduction rules. 

Rule 1. If a CCD contains a cycle of associations (Figure^ < Ai~ Ri — A 2 — 
R 2 — ... — An >, where Ai = An and for every i = l..n — 1 : card(Ri.Ai) = 
O..lorl..l and card(Ri.Ai^i) = l..lorl..*, then for every i = l..n — 1 we 
replace the multiplicities of Ri.Ai and with 1..1. 

Rule 2. If a CCD contains a fragment < A — R — B >, where 

card{A) = 0..1orI..I and card(R.A) = I.. I and card(R.B) = O..IorI..I and 
card{B) = a.. * {a = Oorl), then we replace the multiplicity of the class B 
with a..l (further in the text we will use the assignment symbol for 

denotation of a replacement - in this case it would be card{B) := a..l). 
Rule 3. If a CCD contains a fragment < A — R — B >, where 

card{A) = l..lorI..* and card(R.B) = I..IorI..=t= and card{B) = 0..a(a = 
lor*), then card{B) := l..a. 

Rule 4. If a CCD contains a fragment < A — R — B >, where 

card{B) = O..lorl..l and card(R.B) = a.. * (a = 0..I), then card(R.B) := 

a.. I. 
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Fig. 5. A cycle of associations 



Rule 5. If a CCD contains a fragment < A — R — B >, where 

card{A) = 1..1 and card{R.A) = 1..1 and card{B) = and 

card(R.B) — 0..a(a = lor*), then card(R.B) = l..a. 

Rule 6. If a CCD contains a fragment < A—R—B > and a chain of associations 
< Cl — Ri — C 2 — R 2 — ■■■ — Cn > (Figure ED, where C\ = A and Cn = B 
and card{A) = 0..1 and card{R.A) = 1..1 and 
for every i = l..n — 1 : card(Ri.Ci^i) = l..lorl..* and 
card(R.B) = 0..a{a = lor*), 
then card(R.B) := l..a. 




1..* 



Fig. 6. Conditions for the reduction rule 6 



It is easy to prove the following Lemma. 



Lemma 1. The multiplicity reduction rules retain the equivalence of CCDs. 
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Definition 6. A CCD is canonical if it is not possible to apply any of the mul- 
tiplicity reduction rules to it. 



Definition 7. The canonical form of a CCD X is a canonical CCD, that is 
obtained from X using the multiplicity reduction rules. 

From Lemma [D follows that two CCDs are equivalent iff their canonial forms 
are equivalent. So Theorem Q will be proved if we will prove Theorem El 

Theorem 2. Two canonical CCDs are equivalent iff they are identical. 

The proof will be presented in the last section. In the following sections we 
will introduce some Lemmas needed for the proof. 



5 Properties of CCDs 

In this section some easily provable properties of CCDs are formulated. 

Lemma 2. For every CCD X there exists an instance containing exactly one 
object from every class and one link from every association of X. 



Lemma 3. If a CCD X contains a fragment < A — R — B >, where 
card(R.A) = card(R.B) — 1..1, then every instance of X contains an equal 
number of objects of classes A and B. 



Lemma 4. If a CCD X contains a fragment < A — R — B >, where 
card(R.A) = O..lorl..l and card(R.B) = l..lorl..*, then for every instance d of 
X holds the inequality a < b, where a - the number of objects of the class A in 
d, and b - the number of objects of the class B in d. 



Lemma 5. If a CCD X contains a fragment < A — R — B >, where 
card(R.A) = 0.. * orl..* or card(R.B) = 0..1or0..*, then for every two integers 
a and b that satisfy the inequality a > b > 1, there exists an instance of X, in 
which the number of objects of class A equals a and the number of objects of 
class B equals b. 



Lemma 6. Let CCD X contain a fragment < A — R—B >. Let d be an instance 
of X . Let d' be another instance diagram, obtained from d by changing only the 
links of the association R. If the resulting configuration of R links in d' satisfies 
the multiplicity constraints of the association R, then d' is an instance of X. 
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6 Properties of Canonical CCDs 

In this section some properties of canonical CCDs are formulated. 

Lemma 7. For every canonical CCD there exists an instance eontaining exactly 
1 object from every class with multiplicity 1..1 or 1..* and no objects from classes 
with multiplicity 0..1 or 0..*. 

Lemma 8. For every canonical CCD there exists an instance containing exactly 

1 object from every class with multiplicity 0..1 or 1..1 and 2 objects from every 
class with multiplicity 1..* or 0..*. 

Lemma 9. If a canonical CCD X contains a fragment < A — R — B >, where 
card{A) = 0..1 and card(R.A) = 1..1 and card(R.B) = 0..1or0..*, then there 
exists an instance of X, eontaining exactly 1 object of the class A and no objects 
of the class B. 

Lemma 10 . If a canonical CCD X contains a fragment < A — R — B >, where 
card(R.B) = 0.. * orl..*, then card{B) = 0.. * orl..*. 

Lemma 11 . If a canonical CCD X contains a fragment < A — R — B >, where 
card{A) = 1..1 and card(R.A) = 1..1 and card(R.B) = 0..1or0..*, then 
card{B) = 0..1or0..*. 

Lemma 12 . If a eanonical CCD X contains a fragment < A — R — B >, where 
card(R.A) = O.Aorl.A and card(R.B) = and card{B) = 1.. * orO..*, 

then there exists an instance of X, eontaining exactly 1 object of the class A and 

2 objects of the class B. 

Proof of these Lemmas is based on Lemmas |30and|ni 

7 Proof of Theorem [21 

It is obvious that two identical CCDs are equivalent. So, we need to prove that 
two canonical CCDs that are not identical are not equivalent - i.e., there exists 
an instance diagram that satisfies one of the CCDs but not the other one. 

If two CCDs are not identical, then there can be 3 cases: one CCD contains 
a class or an association that is not present in the other CCD, multiplicities of 
some class differ, or multiplicities of some association end differ. Let’s examine 
each of these cases. 

Case 1. One CCD X contains a class or an association A that is not present in 
the other CCD Y . 

From Lemma 121 there exists an instance d of X that contains one object or 
link of A. Obviously this instance d doesn’t satisfy the CCD Y . So X and Y are 
not equivalent. 
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Case 2. Multiplicities of some class A differ. There are two subcases: 

1. In one CCD (X) card{A) = 0..1or0..*, but in the other CCD (Y) card{A) = 

Then from Lemma 0 follows that there exists an instance d of 
CCD X that contains no objects of A. This instance d doesn’t satisfy the 
CCD Y. So X and Y are not equivalent. 

2. In one CCD (X) card{A) = O..*orl..*, but in the other CCD (T) card{A) = 
O..IorI..I. Then from Lemma0follows that there exists an instance d of CCD 
X that contains 2 objects of A. This instance d doesn’t satisfy the CCD Y. 
So X and Y are not equivalent. 

So, if multiplicities of some class A differ, then CCDs are not equivalent. 



Case 3. Multiplicities of some association end differ. 

First we define an enumeration N for the four used multiplicities as follows: 
1: 0..*, 2: 0..1, 3: 1..*, 4: 1..1. 

Consider two canonical CCDs X and Y that both contain a fragment < 
A — R—B >, and in the enumeration N card(R.B) in X comes before card(R.B) 
in y. To prove that these CCDs are not equivalent, we will find an instance of 
X that doesn’t satisfy card(R.B) in Y. A set of instances of X that contains 
such an instance we will call a set of representative instances for the association 
end R.B. We will prove that such a set exists for all possible multiplicities of A, 
R.A, R.B and B. 

To help to find sets of representative instances we define a set of representa- 
tive fragments of instance diagrams for every multiplicity of R.B (according to 
the enumeration N) containing one or two instance diagram fragments. Figure 0 
shows all the eight sets of representative fragments. For each of the multiplicities 
of R.B there are two or three subcases, depending on the multiplicities of A and 
R.A. For example, let R.B have the multiplicity 1: 0..*. If card{A) = 0.. * orl..* 
then the corresponding set of representative fragments is I. a, consisting of just 
one fragment I. a. I. But if card{A) = O..IorI..I and card(R.A) — 0..I then the 
corresponding set of representative fragments is 1.6, consisting of two fragments 
- 1. 6. 1 and 1.6.2. 

It is easy to see that, if in the enumeration N multiplicity a comes before 
multiplicity 6, then its set of representative fragments contains a fragment, that 
doesn’t satisfy 6. For example, the fragment l.b.l doesn’t satisfy multiplicity 2: 
0..1, the fragment l.b.2 doesn’t satisfy multiplicity 3: 1..*, and both of them 
don’t satisfy multiplicity 4: 1..1. 

Definition 8. The set of representative instances of CCD X for a given associ- 
ation end R.B is a set of instances of X, where for every representative fragment 
of card(R.B) there is one instance that contains it. 

For each of the ten representative fragments (Figure 0 we will prove that 
there exists an instance of the canonical CCD X that contains exactly the num- 
ber of objects of the two involved classes that we need for the fragment. Then 
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according to Lemma|0|we can reorganize links to get the instance containing the 
fragment. 

Fragment l.a.l. According to Tjemma. ITni co.rdtBl = O..*orl..*. Then accord- 
ing to Lemma 0 there is an instance containing 2 objects from the class A and 
2 objects from the class B. 

Fragment l.h.l. According to Tjemma. nTil mrdt Bl = O..*orl..*. Then accord- 
ing to Lemma 0 there is an instance containing 1 object from the class A and 2 
objects from the class B. 

Fragment l.h.2. According to LemmaElthere is an instance containing exactly 
1 object from every class. 

Fragment l.c.l. According to Lemma Elcard(i3) = O..*orl..*. Then accord- 
ing to Lemma 0 there is an instance containing 1 object from the class A and 2 
objects from the class B. 

Fragment l.e.2. If card{A) = 1..1, then according to Tyemma. ll II mrd( B) = 
O.AorO..*. Then according to LemmaQthere is an instance containing 1 object 
from the class A and no objects of the class B. 

If card{A) = 0..1, then according to Lemma 0there is an instance containing 
I object from the class A and no objects of the class B. 

Fragment 2.a.l. According to Lemma0there is an instance containing exactly 
1 object from every class. 

Fragment 2.b.l. According to Lemma 1121 there is an instance containing 2 
objects from the class A and 1 object from the class B. 

Fragment 2.c.l. If card{A) = 1..1, then according to Ijemma,)l II card! B) = 
0..1or0..*. Then according to LemmaQthere is an instance containing 1 object 
from the class A and no objects of the class B. 

If card{A) = 0..1, then according to Lemma 0there is an instance containing 

1 object from the class A and no objects of the class B. 

Fragment S.a.l. According to Ijemma. nTil mrrh'B'l = O..*orl..*. Then accord- 
ing to Lemma IT^ there is an instance containing 1 object from the class A and 

2 objects from the class B. 

Fragment S.h.l. According to Lemma[nilcarc?(A) = O..*orl..* and card{B) = 
0.. *orl..*. Then according to Lemma 0 there is an instance containing 2 objects 
from the class A and 2 objects from the class B. 

So, if multiplicities of some association end R.B differ, then CCDs are not 
equivalent. 

This concludes the proof of Theorem El 

8 Conclusion 

In the paper a reduction algorithm is presented that for a given composite class 
diagram produces the corresponding canonical composite class diagram. It uses 
a set of rules to eliminate the multiplicity inconsistencies in the diagram. 

We prove that two composite class diagrams are equivalent if and only if they 
can be reduced to the same canonical composite class diagram. 
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These presented algorithms can be used in CASE diagramming tools to detect 
multiplicity inconsistencies in class diagrams. 

Besides that we are planning to use these results in solving the ’’intuitive” 
equivalence problem for class diagrams. One way of defining the ’’intuitive” 
equivalence of class diagrams is via composite classes: two class diagrams we 
call ’’intuitively” equivalent, if, by replacing some classes with composite classes, 
they can be transformed into class diagrams that are strictly equivalent. 
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Abstract. We prove that both minimum and maximum traveling sales- 
man problems on complete graphs with edge-distances 1 and 2 are ap- 
proximable within 3/4. Based upon this result, we improve the stan- 
dard approximation ratio known for maximum traveling salesman with 
distances 1 and 2 from 3/4 to 7/8. Finally, we prove that, for any 
e > 0, it is NP-hard to approximate both problems within better than 
5379/5380 -f e. 

1 Introduction 

Given a complete graph on n vertices, denoted by K^, with edge distances ei- 
ther 1 or 2 the minimum traveling salesman problem (min_TSP12) consists in 
minimizing the cost of a Hamiltonian cycle, the cost of such a cycle being the 
sum of the distances on its edges (in other words, in finding a Hamiltonian cycle 
containing a maximum number of 1-edges). The maximum traveling salesman 
problem (max_TSP) consists in maximizing the cost of a Hamiltonian cycle (in 
other words, in finding a Hamiltonian cycle containing a maximum number of 
2-edges). A generalization of TSP12, denoted by TSPo6, is the one where the 
edge-distances are either a, or b, a < b. Both min_ and max_TSP12, and TSPa6 
are NP-hard. 

Given an instance / of an NP optimization (NPO) problem II and a polyno- 
mial time approximation algorithm A feasibly solving 7T, we will denote by tu(/), 
Aa(/) and (3{I) the values of the worst solution of /, of the approximated one 
(provided by A when running on I), and the optimal one for I, respectively. 
Gommonly (|S|), the quality of an approximation algorithm for an NP-hard 
minimization (resp., maximization) problem U is expressed by the ratio (called 
standard in what follows) pp,{I) = X{I)/P{I), and the quantity pn = infjr : 
Pk{I) < r, / instance of 77} (resp., pt, = sup{r : Pa{I) > r, 7 instance of 77}) 
constitutes the approximation ratio of A for 77. Another approximation-quality 
criterion used by many well-known researchers ( |2lli;a4ll4fn^ l is what in P 
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El we call differential- approximation ratio. It measures how the value of an ap- 
proximate solution is placed in the interval between lu{I) and P{I). More for- 
mally, the differential-approximation ratio of an algorithm A is defined as Si^(I) = 
|w(/) — A(/)|/|w(/) — /3(/)|. The quantity Sn = sup{r : Sf,{I) > r,I instance of 77} 
is the differential approximation ratio of A for 77. Another type of ratio, very 
close to the differential one, has been defined and used in |5|. There, instead 
of w(7), the authors used a value zr, called reference-value, smaller than a;(7). 
The ratio introduced in |5| is defined as (7 a(7^) = 1/3(7) — Aa(7)|/|/3(7) — zr\. The 
quantity |/3(7) — Aa( 7)| is called deviation of A, while |/3(7) — zr\ is called abso- 
lute deviation. For reasons of economy, we will call c?a(7) deviation ratio. For a 
given problem, setting zr = w(7), c/a(7) = 1 — <3a( 7) and both ratios have, as it 
has already mentioned above, a natural interpretation as the estimation of the 
relative position of the approximate value in the interval worst solution-value - 
optimal value. In |2j, the term “trivial solution” is used to denote the solution 
realizing the worst among the feasible solution-values of an instance. Moreover, 
all the examples in |2| carry over NP-hard problems for which worst solution can 
be trivially computed. This is for example the case of maximum independent set 
where, given a graph, the worst solution is the empty set, or of minimum vertex 
cover, where the worst solution is the vertex-set of the input-graph, or even of the 
minimum graph-coloring where one can trivially color the vertices of the input- 
graph using a distinct color per vertex. On the contrary, for TSP things are very 
different. Let us take for example min_TSP. Here, given a graph 7f„, the worst 
solution for 7f„ is a maximum total-distance Hamiltonian cycle, i.e., the optimal 
solution of max_TSP in 7F„. The computation of such a solution is very far from 
being trivial since max_TSP is NP-hard. Obviously, the same holds when one 
considers max_TSP and tries to compute a worst solution for its instance, as 
well as for optimum satisfiability, for minimum maximal independent set and 
for many other well-known NP-hard problems. In order to remove ambiguities 
about the concept of the worst- value solution of an instance 7 of an NPO prob- 
lem 77, we will defined it as the optimal solution opt(77') of an NPO problem 77' 
having the same set of instances and feasibility constraints as 77 verifying 

; / 7 -r/N f rnax opt(77) = min 
opt(77 ) = 7 . . 

I min opt(77) = max 

In general, no apparent links exist between standard and differential approx- 
imations in the case of minimization problems, in the sense that there is no 
evident transfer of a positive, or negative, result from the one framework to the 
other. Hence a “good” differential-approximation result signifies nothing for the 
behavior of the approximation algorithm studied when dealing with the stan- 
dard framework and vice-versa. When dealing with maximization problems, we 
show in HH that the approximation of a maximization NPO problem 77 within 
differential-approximation ratio S, implies its approximation within standard- 
approximation ratio S. 

The best known standard-approximation ratio known for min_TSP12 is 7/6 
(presented in [II 2)1. while the best known standard inapproximability bound is 
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5381/5380 — e, for any e > 0 (0). On the other hand, the best known standard- 
ratio max_TSP is 3/4 (HHI)- To our knowledge, no better result is known in 
standard approximation for max_TSP12. Furthermore, no special study of TSPa6 
has been performed until now (a trivial standard-approximation ratio or h/a 
of a/b is in any case very easily deduced for min_ or max_TSPa6). 

Here we show that min_ and max_TSP12 and min_ and max_TSPa6 are 
all equi-approximable within 3/4 for the differential approximation. We also 
prove that all these problems are inapproximable within better (more than) 
3475/3476 -I- e, for any e > 0. Finally, we improve the standard-approximation 
ratio of max_TSP12 from 3/4 ([I3|) to 7/8. 

In what follows, we will denote hy V = {ui, . . . ,Un} the vertex-set of 
by E its edge-set and, for ViVj £ E, we denote by d{vi,Vj) the distance of the 
edge ViVj £ E; we consider that the distance-vector is symmetric and integer. 
Given a feasible TSP-solution T{K„) of (both min_ and max_TSP have the 
same set of feasible solutions), we denote by d{T{Kn)) its (objective) value. Given 
a graph G, we denote by V{G) its vertex-set. Finally, given any set C of edges, 
we denote by d(C) the total distance of G, i.e., the quantity J2viVjeC 

2 Differential- Approximation Preserving Reductions 
for TSP12 

Theorem 1. min-TSP12, max_TSP12, miri-TSPab and max-TSPab are all 
equi-approximable for the differential approximation. 

Proof (sketch). In order to prove the theorem we will prove the following stronger 
quoted proposition. 

Consider any instance I = (iF„,d) (where d denotes the edge-distance 
vector of Kn). Then, any legal transformation d 'j.d-\-r].l of d /y, 77 £ 

Q/ produces differentially equi-approximable TSP-problems. 

Suppose that TSP can be approximately solved within differential-approxima- 
tion ratio (5 and remark that both the initial and the transformed instance 
have the same set of feasible solutions. By the transformation considered, the 
value d{T{Kn)) of any tour T{Kn) is affinely transformed into yd{T{Kn)) -\-ijn. 
Since differential-approximation ratio is stable under affine transformation, the 
equi-approximability of the original and of the transformed problem is immedi- 
ately deduced, concluding so the proof of the quoted proposition. 

In order to prove that min_TSPI2 and max_TSPI2 are equi-approximable it 
suffices to apply the proposition above with 7 = — 1 and 77 = 3. On the other 
hand, in order to prove that min_ or max_TSPI2 reduces to min_ or max_TSPo&, 
we apply the quoted proposition with 7 = 1/(6 — a) and rj = {b — 2a) /{b — a), 
while for the converse reduction we apply the quoted proposition with y = b — a 
and rj = 2a — b. Since the reductions presented are transitive and composable, 
the equi-approximability of the pairs (min_TSP12, max_TSP12) and (TSP12, 
TSPa6) proves the theorem. 
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3 Approximating Min TSP12 

Let us first recall that, given a graph G, a 2-matching is a set M of edges 
of G such that if V (M) is the set of the endpoints of M, the vertices of the 
graph (V{M),M) have degree at most 2; in other words, the graph {V{M),M) 
is a collection of cycles and simple paths. A 2-matching is optimal if it is the 
largest over all the 2-matchings of G. It is called perfect if any vertex of the 
graph (V (M),M) has degree equal to 2, i.e., if it constitutes a partition of V (M) 
into cycles. Remark that determining a maximum 2-matching in a graph G is 
equivalent to determining a minimum total-distance vertex-partition into cycles 
into G U G (the complement of G), where the edges of G are considered of 
distance 1 and the ones of G of distance 2. 

As it is shown in cni, an optimal triangle-free 2-matching can he computed 
in polynomial time. As it is mentioned just above, this becomes to compute 
a triangle-free minimum-distance collection of cycles in a complete graph K„ 
with edge-distances 1 and 2. Let us denote by M such a collection. Starting 
from M, we will progressively patch its cycles in order to finally obtain a unique 
Hamiltonian cycle in AT„. 

In what follows, for reasons of paper length’s constraints, lemmata 0 and El 
are presented without their proofs which can be found in m- 

3.1 Preprocessing M 

Definition 1. Let G± and G 2 be two vertex-disjoint cycles. Then: 

— a 2-exchange is any exchange of two edges viUi G Gi, V 2 U 2 G G 2 by the 
edges V 1 V 2 and U 1 U 2 ; 

— a 2-patching of G\ and G 2 is any cycle G resulting from a 2-exchange on G\ 
and G 2 , i.e., G = (Gi U G 2 ) \ {v\Ui,V 2 U 2 } U {v\V 2 ,uiU 2 } , for any pair 
(viUi,V2U2) G Gi X G 2 . 

A matching minimal with respect to the 2-exchange operation will be called 
2- minimal. 

Definition 2. A 2-matching M = (Gi, G 2 , . . . , G\m\) is 2-minimal if it verifies, 
V(Gi,Gj) G M X M, Gi ^ Gj, Vuiui G Gi, VU 2 U 2 G Gj, d{vi,V2) + d{ui,U2) > 
d{uivi) -\- d{u2V2). 

In other words, a 2-matching M is 2-minimal if any 2-patching of its cycles 
produces a 2-matching of total distance strictly greater than the one of M. 
Obviously, starting from a 2-matching M transformation of M into a 2-minimal 
one can be performed in polynomial time. Moreover, suppose that there exist two 
distinct cycles G and G' of M, both containing 2-edges and denote hy uv G G and 
u'v' G G' two such edges. Then, d{uu') -\- d{vv') ^ 4, while d{uv) -\- d{u'v') = 4, 
a contradiction. So, the following proposition holds. 

Proposition 1. In any 2-minimal 2-matching, at most one of its cycles contains 
2-edges. 
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Remark 1. If the size of a 2-minimal triangle-free 2-matching M is 1, then, since 
a Hamiltonian tour is a particular case of triangle-free 2-matching, M is an 
optimal min_TSP12-solution. Hence, in what follows we will suppose 2-matchings 
of size at least 2. 

Assume now a 2-minimal triangle- free 2-matching M = (Ci, . . . , Cp, Co), verify- 
ing remark ^ where by Cq is denoted the unique cycle of M' (if any) containing 
2-edges. Construct a graph H = Vh, Eh) where Vh = {tci, • • • , Wp} and contains 
a vertex per cycle of M' and, for i ^ j, WiWj G Eh iff 3(u,u) G Ci x Cj such 
that d{u, v) = 1. Consider a maximum matching Mh, \Mh \ — q, of H. With any 
edge Wi^Wjs of Mh we associate the pair {Cis,Cjs) of the corresponding cycles 
of M . So, M can be described (up to renaming its cycles) as 

q r—p—2q 

M=[j{ClCI} IJ {Cj|J{Co} (3.1) 

S = 1 t=l 

where for s = I,. . . ,q, 3e® G V{C() x F(C|) such that d{e'^) = 1. 

Consider M as expressed in expression (ft. Ill , denote by Vs the set of the 
four vertices of Cf and C| adjacent to the endpoints of e®, and construct the 
bipartite graph B = (V^ U Vq,Eb) where Vg = {wi, . . . , Wr} (i.e., we associate 
a vertex with a cycle Ct, t = 1, . . . ,r), Vg = {w^l, . . . (i.e., we associate 

a vertex with a pair ((71,(71), s = 1, . . . g) and, V(t, s), wtw’^ G Eb iff G (7t, 
3v G Vs such that d{u^v) = 1. Compute a maximum matching Mb, \Mb\ = q' 
in B. With any edge WtW^ G Mb we associate the triple ((7f , (7|, (7*). So, M can 
be described (up to renaming its cycles) as 

q q r'—r—q 

M={j{Cl,ClCi} U U {CaUiCo} (3.2) 

s— 1 s—q' + l t—1 

where for s = 1, . . . , q' , 3/® G V), x C((7|) such that d{f^) = 1. In what follows 
we will reason with respect to M as it has been expressed in expression d22l. 

3.2 Computation and Evaluation of the Approximate Solution and 
a Lower Bound for the Optimal Tour 

In the sequel, call s.d.e.p. a set of vertex-disjoint elementary paths, denote by 
PREPROCESS the algorithm of the achievement of M (expression 13. 211 1 following 
from the discussion of section lO and consider the following algorithm. 

BEGIN (*TSP12*) 

compute a 2-minimal triangle-free 2-matching M in Kn ; 

M ^ PREPROCESS(M) ; 

D G- 0; 

(1) FOR s ^ 1 TO q' DO 

let gf be the edge of C® adjacent to both e^ and f®; 
choose in C| an edge gj adjacent to e® ; 
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choose in C§ an edge g| adjacent to f®; 

D ^ D U q U C| U C§ \ {g^ g|, g§} U {e% f n ; 

OD 

(2) FOR s ^ q' + 1 TO q DO 

choose in C® an edge g^ adjacent to e® ; 
choose in Cj an edge gj adjacent to e® ; 

D^DUqUC|\{g^g|}U{en; 

OD 

(3) FOR t ^ 1 TO r' DO 

choose any edge gt in Ct ; 

D^DUCt\{gt}; 

OD 

(4) IF there exists in Co an 1-edge e 

THEN choose eui edge go of Cq adjacent to e; 

ELSE choose any edge go of Cq ; 

D "S— D U Co \ {go} ; 

FI 

(5) complete D in order to obtain a Hamiltonian tour T(Kn) ; 

OUTPUT T(Kn); 

END (*TSP12*) 

Both achievement of a 2-minimal triangle free 2-matching and the PREPROCESS 
of it, can be performed in polynomial time. Moreover, steps (1) to (4) are also 
executed in polynomial time. Finally, step 5) can be performed by arbitrarily 
ordering (mod|ZI|) the chains of the s.d.e.p. D and then, for i = 1, . . . , |H|, 
adding in D the edge linking the “last” vertex of chain i to the “first” vertex of 
chain i + 1. Consequently, the whole algorithm TSP12 is polynomial. 

Lemma 1. c?(T(iF„)) ^ d{M) + q + r' . 

Proof. During steps (1) to (4) of algorithm TSP12, set D remains a s.d.e.p. 
At the end of step (4), D contains M minus the 3g' + 2{q — q') + r' = q' + 

2q + r' l-edges of the set U^=i|gf , ^ 2 : ffll ^Lg'+i Iffi-Sll {9t} minus (if 
Co yf 0) one 2-edge of Cq plus the 2q' + {q — q') = q' + q 1-edges of the set 

cuPs=i{ef,/|}cMpf^g,+i{e^}. So D is a s.d.e.p. of size n - {q + r') - 
and of total distance d{M) — {q + r') — 2.1c^^f]f. Completion of D in order to 
obtain a tour in AT„, can be done by adding q + r' + lco 5^0 edges. Each 
of these new edges can be, at worst, of distance 2. We so have d(T(AT„)) < 
d{M) ~{q + r' + 2.1c^^^) + 2{q + r' + = d{M) +q = r', q.e.d. 

On the other hand, the optimal tour being a particular triangle-free 2-matching, 
the following lemma holds immediately. 

Lemma 2. /3(AT„) ^ d{M). 

3.3 Evaluation of the Worst- Value Solution 

In what follows in this section we will bring to the fore a s.d.e.p., all of its 
edges being of distance 2 (called 2-s.d.e.p.). Given such a s.d.e.p. W, one can 
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proceed as in step (5) of algorithm TSP12 C section l!01l . in order to construct a 
Hamiltonian tour whose total distance is a lower bound for oj{Kn). 

Denote by E2 the set of 2-edges of cycle Cq. If g = 0, i.e., Mh = 0, and if 
Co = E2, then the tour computed by TSP12 is optimal. 

Lemma 3. ((g = 0) A (Cq = E2)) (5Tspi2(^n) = 1- 

Proof. Let k = |y(Co)| = d{M) — n and set V{Co) = {ai,...,afc}. By the 
fact that M is 2-minimal, all the edges of Kn incident to these vertices have dis- 
tance 2. On the other hand, between two distinct cycles in the set {Ci, . . . , Cp=r'} 
of M, there exist only edges of distance 2. Consider the family 

By the remarks just above, any edge linking vertices of two distinct sets of T 
is a 2-edge. Any feasible tour of (a posteriori an optimal one) integrates 
the k + p sets of T by using at least k + p 2-edges pairwise linking these sets. 
Hence, any tour uses at least k + p 2-edges, so does tour T(A'„) computed by 
algorithm TSP12, q.e.d. 

So, we suppose in the sequel that g = 0 Co yf C2. We will now prove the 
existence of a 2-s.d.e.p. W of size d(M) + 4(g + r') — n, where M is as expressed 
by expression 

Proposition 2. Between two cycles Ca and Ct of M of size at least k, there 
always exists a path, alternating vertices of Ca and Cb, containing at least k 
2-edges. 

Proof. Let {oi, . . . , 0 ^+ 1 } and {bi , . . . , bk+i} be fc -P 1 successive vertices of two 
distinct cycles Ca and Cb of size at least k (eventually a\ = Ok+i if \V{Ca)\ = k 
and bi — bk+i if |H(Cf,)| = k). We will show that there exists a path, alternating 
vertices of Ca and Cb, of size 2fc — 1 and of distance at least 3A: — 1. Consider 
paths C = ujLi{ai6J {at+ibi} and D = '^i=i {aA+i}. By the 

2-minimality of M we get: Vi = 1, . . . , fc 

max (d (u^ ,bf^ , d (u^_|_]^ , } — 2 =P d (u^ “t“d(nj_|_x,6j_|_i) ^ 3 

and Vi = 1 , . . . , fc — 1 

max{d(ai,6i+i) ,d{ai+i,bi)} = 2^ d{ai,bi+i) -P d (0^+1, 6^) ^ 3 



Summing the terms of the expression above member-by-member, one obtains: 

k k—1 



(d (oi, bi) -P d (Oi+i, 6i+i)) + (d {a,+i,bi) -P d (a^, 6^+1)) ^ 6fc - 3 

6k -S' 



i—1 2=1 

d(C) -P d(D) ^ 6fc — 3 =P max {d(C), d(D)} ^ 



= 3fc- 1. 
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Application of proposition 0 in any pair of M results to the following. 

Claim 1. Vs = 1, . . . , (7, there exists a 2-s.d.e.p. of size 4, alternating vertices 
of cycles Cf and C|, containing a vertex of Vg whose degree with respect to 
is 1. 

In figure 0 we show an application of claim 0 We assume = 0262; then 
{ai,6i} C Vg. The 2-s.d.e.p. W® claimed is {6102, (0363, 0464), 0565} and the 
degree of bi with respect to W® is 1. 

Consider now the s.d.e.p. Wf = W®UW'(, where W® as in claim0and W'^ is 
any path of size 4 alternating vertices of C| and of Cj, s = 1, . . . ,q' , t = 1, ... ,r'. 
By the optimality of Mh, any edge linking vertices of CJ to vertices of Ct is a 
2-edge. Consequently, Wf is a 2-s.d.e.p. and the following claim holds. 

Claim 2. Vs = 1,. . . ,q' , Vt = 1, . . . ,r' , there exists a 2-s.d.e.p. Wf of size 8, 
alternating vertices of the cycles C{ and C|, and of the cycles C| and Ct. 

For s = q' . ,q, t = \, . . .r' , consider the triple (Cf , C|, Ct). Let e® = efe|, 

Vg = {rtf , rif, u|} and consider any four vertices at, bt, Ct and dt of Ct- By 

the optimality of Mb, any vertex of Ct is linked to any vertex of exclusively 
by 2-edges. Moreover, the 2-minimality of M implies that at least one of ufef 
and is of distance 2. If we suppose d(uf,ef) = 2 (figure 0), then the path 
{62, uf, at, vf,bt,U2,Ct,V2,dt} is a 2-s.e.d.p. In all, the following claim holds. 

Claim 3. Vs = g'-l- 1, . . . , g, V< = 1, . . . , r', there exists a 2-s.d.e.p. Wf of size 8, 
alternating vertices of the cycles Cf, C| and Ct. 

Let r' ^ 2 and consider the (residual) cycles Ct, t = 1, . . . ,r' . All edges between 
these cycles are of distance 2. If we denote by at, bt, Ct and dt four vertices of Ct, 
the path 

\^a\, ..., at, ... , aj.' , b\ , . . . , bt, . . • , bj.> , c\, . . . , Ct, . . . , Cj-f , d\, . . . , dt, . . . , dr ' } 

is a 2-s.d.e.p. of size 4r' — I and the following claim holds. 
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Fig. 2. The 2-s.d.e.p. Wi of claim Q 

Claim 4. If r' ^ 2, then there exists a 2-s.d.e.p. IF’' of size 4r' — 1 alternating 
vertiees of cycles Ct, t = 1, ... ,r' . 

Lemma 4. {{q ^ 1) V ((Cq C 2 ) A (r ^ 1))) ^ <5Tspi2(^n) ^ 3/4. 

Lemma Elis proved in HU- There, we use claims PI2lEland0 in order to bring to 
the fore a 2-s.d.e.p. W of total distance 4(g -h r') — + d{M) -n-\- 4 co^E2 ^ 

d{M) — n-\- 4(g -|- r'), the completion of which produces a Hamiltonian tour of 
distance d{M) {q r') at least. 

Lemma 5. {{q = 0) A (r = 1) A (Cq ^ C 2 )) Sjspi 2 {K„) > 3/4. 

In all, combining lemmata □ HE] and 0, the following theorem can be immedi- 
ately proved. 

Theorem 2. min-TSP12 is approximable within differential-approximation ra- 
tio 3 / 4 . 

Theorems Q and O induce the following corollary. 

Corollary 1. mim and max-TSP12 as well as mim and max-TSPab are ap- 
proximable within differential-approximation ratio 3/4. 

Consider two cliques and number their vertices by {1, ... , 4} and by {5, 6, . . . , n-\- 
8}, respectively. Edges of both cliques have all distance 1. Cross-edges ij, i = 1,3, 
j = 5, . . . , n-l-8, are all of distance 2, while every other cross-edge is of distance 1. 
Unraveling of TSP12 will produce: T = {1, 2, 3, 4, 5, 6, . . . , n -|- 7, n -|- 8, 1} (cycle- 
patching on edges (1, 4) and (5, n -\- 8)), while Tyj = {1, 5, 2, 6, 3, 7, 4, 8, 9 . . . , n -I- 

7, n -I- 8, 1} (using 2-edges (1, 5), (6, 3), (3, 7) and (n -|- 8, 1)) and T* = {l,2,n-|- 

8, n-l-7, . . . , 5, 4, 3, 1} (using l-edges (4, 5) and (2, n-|-8)). In figure El T* and T„, 
are shown for n = 2 (T = {1, . . . , 10, 1}). Consequently, <5TSPi2(^n-i-8) = 3/4 and 
the following proposition holds. 
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2 




T* and 



Fig. 3. Tightness of the TSP12 approximation ratio. 



Proposition 3. Ratio 3/4 is tight for TSP12 

Let us note that the differential approximation ratio of the 7 / 6-algorithm of ^2| , 
when running on iL„+8, is also 3/4. The authors of bring also to the fore a 
family of worst-case instances for their algorithm: one has k cycles of length four 
arranged around a cycle of length 2k. We have performed a limited comparative 
study between their algorithm and ours one, for fc = 3, 4, 5, 6 (on 24 graphs). The 
average differential and standard approximation ratios for the two algorithms 
are presented in tabled Using corollary ^ and the facts that w(iL„) ^ bn and 
(3{Kn) ^ an, the following holds. 



Table 1. A limited comparison between TSP12 and the algorithm of on some 
worst-case instances of the latter. 





k 


TSP12 


The algorithm of jI2I 


.2 


3 


0.931100364 


0.846702091 


— 


4 


0.9000002 


0.833333 


0) 


5 


0.920289696 


0.833333 


5 


6 


0.9222222 


0.833333 


_o 








u 


3 


0.923350955 


0.87013 


T3 


4 


0.9094018 


0.857143 


cS 

T3 


5 


0.92646313 


0.857143 


ci 


6 


0.928178 


0.857143 



Proposition 4. miu-TSPab is approximable within p ^ (1 -I- (6 — a/4a) in the 
standard framework. This ratio tends to oo when b = o(a). 
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Revisit min_TSP12. Using n ^ l3{Kn) ^ uj{Kn) ^ 2n, one can see that ap- 
proximation of min_TSP12 within S = 1 — e implies its approximation within 
p = 2— (1 — e) = l-|-e, O^e^l. Using the result of 0 and theorem 0 one 
gets the following differential-inapproximability result. 

Theorem 3. min- and max-TSPab and min- and max-TSPVl are inapprox- 
imable within differential-ratio greater than, or equal to, 5379/5380 -I- e, Ve > 0, 
unless P—NP. 

4 An Improvement of the Standard Ratio for the 

Maximum Traveling Salesman with Distances 1 and 2 

Combining expressions (5max_TSPi2 ^ 3/4, Wmax(-ffn) ^ an and /3max(-f^^n) ^ bn, 
one deduces Pmax_TSPi 2 ^ (3/4) -I- (a/46). Setting o = 1 and 6=2, the following 
theorem immediately holds. 

Theorem 4. max-TSP12 is polynomially approximable within standard- appro- 
ximation ratio bounded below by 1/8. 

The algorithm of H2! in Kn solves max_TSP on ifT„ within standard-approxi- 
mation ratio bounded below by 2/3. 

Note that standard-approximation ratio 7/8 can be obtained by the following 
direct method. 

BEGIN /max_TSP12/ 

find a triauigle-free 2-matching M = {Ci, C 2 , . . .} ; 

FOR all Cl DO delete a minimum-distance edge from Ci DD 
let Mt the collection of the paths obtained; 
properly link the paths of Mt to get a Hamiltonian cycle T; 
OUTPUT T; 

END. /max_TSP12/ 

Let p be the number of cycles of M where 2-edges have been removed dur- 
ing the FOR-loop of algorithm max_TSP12. Then, Amax_TSPi 2 (^n) ^ d{M) — p, 
(3{Kn) < d{M) and, since M is triangle-free, d{M) ^ 8p. Consequently, 
'^max_TSP12 (Kn)// max (K„) ^ 7/8. 
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Abstract. The intention of the paper is first to show the applicability 
of the general categorical framework of open maps to the setting of true 
concurrent models with dense time. In particular, we define a category 
of timed event structures and an accompanying path (sub)category of 
timed words. Then we use the framework of open maps to obtain an ab- 
stract notion of bisimulation which is established to be equivalent to the 
standard notion of timed bisimulation. Using the fact, we finally show 
decidability of timed bisimulation in the setting of finite timed structures. 

Keywords: category theory, timed event structures, timed bisimulation 



1 Introduction 

As a response to the numerous models for concurrency proposed in the literature 
Winskel and Nielsen have used category theory as an attempt to understand the 
relationship between models like event structures, Petri nets, trace languages 
and asynchronous transition systems m- Further, to provide a convincing way 
to adjoining abstract equivalences to a category of models, Joyal, Nielsen, and 
Winskel proposed the notion of span of open maps uni that is an abstract 
generalization of Park and Milner’s bisimulation. Furthermore, in m open maps 
have been used to define different notions of bisimulation for a range of models, 
but none of these have modelled real-time. 

Recently, the demand for correctness analysis of real time systems, i.e. sys- 
tems whose descriptions involve a quantitative notion of time, increases rapidly. 
Timed extensions of interleaving models have been investigated thoroughly in 
the last ten years. Various recipes on how to incorporate time in transition sys- 
tems — the most prominent interleaving model — are, for example, described 
in nEi. Timed bisimulation was shown decidable for finite timed transition 
systems by Cerans in @, and since then more efficient algorithms have been 
discovered in fnETi) . 

* This work is partially supported by the Russian Fund of Basic Research (Grant N 
00-01-00898). 
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On the other hand, the incorporation of quantitative information into non- 
interleaving abstract models has received scant attention: a few extensions are 
known of causal trees 0, pomsets 0, configurations sets of posets El , net 
processes m, and event structures Tjm . 

The contribution of the paper is first to show the applicability of the general 
categorical framework of open maps to the setting of true concurrent models 
with dense time. Here we define a category of timed event structures, where 
the morphisms are to be thought of as simulations, and an accompanying path 
(sub)category of timed words, which, following |1()|. provides us with notions 
of open maps and bisimulation. Furthermore, we show within the framework of 
open maps that timed bisimulation is decidable for finite timed event structures. 

There have been several motivations for this work. One has been given by 
the papers um]m\ which have proposed and investigated categorical charac- 
terizations of event structure models. A next origin of this study has been a 
number of papers (see | |5lldl2()| among others), which have extensively stud- 
ied time-sensitive equivalence notions for interleaving models. However, to our 
best knowledge, the literature of timed true concurrent models has hitherto 
lacked such the equivalences. In this regard, the papers is a welcome ex- 

ception, where the decidability question of timed testing has been treated in the 
framework of event structures with time notions. Finally, another motivation has 
been given by the paper P], which provides an alternative proof of decidability 
of bisimulation for finite timed transition systems in terms of open maps, and 
illustrates the use of open maps in presenting timed bisimularity. 

The rest of the paper is organized as follows. The basic notions concerning 
timed event structures are introduced in the next section. A category of timed 
event structures and an accompanying path (sub)category of timed words, are 
defined in section 3. A notion of TLF-open morphisms and its alternative char- 
acterization are provided in section 4. In section 5, basing on spans of TLF-open 
maps, the resulting notion of bisimulation is studied, and shown to coincide with 
the standard notion of timed bisimulation. Section 6 is devoted to decidability of 
timed bisimulation in our framework. Section 7 contains conclusion and future 
work. 

2 Timed Event Structures 

In this section, we introduce some basic notions and notations concerning timed 
event structures. 

We first recall a notion of an event structure m- The main idea behind 
event structures is to view distributed computations as action occurrences, called 
events, together with a notion of causality dependency between events (which 
reasonably characterized via a partial order). Moreover, in order to model non- 
determinism, there is a notion of conflicting (mutually incompatible) events. A 
labelling function records which action an event corresponds to. 

Let Act be a finite set of actions. A (labelled) event strueture over Act is 4- 
tuple S = (A, <,#, 1), where if is a countable set of events; < C if x if is a partial 
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order (the causality relation), satisfying the principle of finite causes: \/e € E » 
{e' G E \ e’ < e} is finite; ff C E x E is a symmetric and irreflexive relation 
(the conflict relation), satisfying the principle of conflict heredity: Ve, e', e" G E o 
e ff e' < e" => e ffe" ] I : E — >■ Act is a labelling function. 

Let C C E. Then C is left-closed iff Ve, e' G E » e G C A e'<e=>e'sC; 
C is conflict-free iff Ve, e' G C o ~<{e ff e'); C is a configuration o/ S' iff C is 
left-closed and conflict-free. Let C{S) denote the set of all finite configurations 
of S. For C G C(S), we define En{C) = {eGE\CA{e}G C(S)}. 

Next we present a model of timed event structures which are a timed exten- 
sion of event structures by associating their events with two timing constraints 
that indicate earliest and latest event occurrence times both with regard to global 
clock. Events once ready — i.e., all their causal predecessors have occurred and 
their timing constraints are respected — are forced to occur, provided they are 
not disabled by others events. A state of a timed event structure is a set of its 
event occurred, equipped with a set of clocks corresponding to the events and 
recording a global time moment at which events occur. A timed event structure 
progresses through a sequence of states by occurring events at a certain time mo- 
ment. An event occurrence takes no time. Let N be the set of natural numbers, 
R^J" the set of nonnegative real numbers. 

Definition 1 A timed (labelled) event structure over Act is a tuple TS = 
{S, Eft, Lft), where 

— S is a (labelled) event structure over Act; 

- Eft, Lft: E^ R+ are functions of the earliest and latest occurrence times 
of events, satisfying Eft(e) < Lft{e) for all e G E. 

In a graphic representation of a timed event structure, the corresponding 
action labels and time intervals are drawn near to events. If no confusion arises, 
we will often use action labels rather event identities to denote events. The 
<-relations are depicted by arcs (omitting those derivable by the transitivity), 
and conflicts are also drawn (omitting those derivable by the conflict heredity). 
Following these conventions, a trivial example of a labelled timed event structure 
is shown in Fig. [U 



TSi : 

[0,1] [2,3] 

ao : eo 02: 62 



# 



ai : ei 

[ 1 , 2 ] 

Fig. 1. 



Let E{TS) = [E ^ R+] be the set of time assignments for events from 
E. Given v G EifTS), we let A(v) = max{z/(e) ] e G E}. A state of TS is a 
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pair {C,v), where C € C{S) and v € r{TS). The initial state of TS is a pair 
{Cs, vts) = (0, 0). 

In a state {C^v), the occurrence of an event e after passing a time d G 

Rg leads to the state (denoted {C,n) {C',v')), if C = C U {e}, 

|B\{e}= V, v'{e) = A{v)+d and Eft{e) < v'{e) < mm{Lft{e') \ e' S En(C)}. 

We shall write (C,v) if {C ,v') and l{e) = a. A state 

(C, v) is reachable iff either (C, v) = (Cs, r'Ts) or there exists a reachable state 

(C',C) such that (C',i/') (C, i^) for some e G E and d G Rg . Let RS{TS) 

denote the set of all reachable states of TS. 

A timed word w of an alphabet Act over R^j" is a finite sequence of pairs: w = 
{ai,di){a 2 , d^) ■ ■ ■ {an, dn), where G Act and di G R^J" for all 1 < * < n. Given a 
timed word w = (oi, di)(a 2 , c? 2 ) • • ■ {an, dn), a run r of rc is a finite sequence of the 

form: r = (0,0) (Ci,:^i) {C 2 , 1 ^ 2 ) . . . {Cn-i, i^n-i) (C„,j^„). 

As an illustration, we construct the set of the timed words corresponding to 
the runs of the timed event structure TSi (see Fig. 1): {(ao,do) | 0 < do < 1} 
U {(ai,di) I di = 1} U {(tto, do)(flii di) | 0 < do < 1, 1 < do + di < 2} U 
{(oi, di)(oo, do) I di = 1, do = 0} U {(oq, do)(o 2 , d 2 ) | 0 < do < 1, do + d 2 = 2}. 

3 A Category of Timed Event Structures 

In this section, we define a category of timed event structures and an accompa- 
nying path (sub)category of timed words. 

The morphisms of our model category will be simulation morphisms following 
the approach of mu. This leads to the following definition of a morphism that is 
a function, mapping events of the simulated system to simulating events of the 
other, satisfying some requirements. 

Definition 2 A morphism between timed event structures TS = {S = {E, <, 
#, 1), Eft, Lft) and TS' = {S' = {E' , <' , , I'), Eft', Lft'), fx:TS^ TS', 

is a function fa : E ^ E' such that: 

— I' o fj, = 1; 

— C G C{S) ^ fi C G C{S') and the following constraints hold: 

• Ve, e' G C o fi{e) = fi{e') => e = e' ; 

• Ve G C o Eft'{fj,{e)) < Eft{e); 

• min{L/t(e) | e G En{C)} < mm{Lft'{e) \ e G En{fa C)}. 

Here we assume that the minimal value of the empty set is equal to — 00 . 

As an illustration, consider a morphism from the timed event structure T S 2 
in Fig. 2 to the timed event structure TSi in Fig. 1 mapping events e( to Ci 
(0 < i < 2). It is easy to check that the constraints in Definition 2 are satisfied. 

Let us consider a simulation property of a morphism defined prior to that. 
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TS 2 ■■ 



[0,1] [2,3] 

ao : 6 q a2 ■ 




Fig. 2. 



Theorem 1 Given a morphism jjL \ TS ^ TS' and a timed word {ai,di) ... 
(a„,d„). If{Cs,^Ts) = (Co,^^o) ... (C„,z/„) 

is aruninTS, then {C s' , vt S') = (mC'di^'o) {p,Ci,v[) ... {p. Cn-i,v'n-i) 

( _r^n) ^ T5". 



Proof Sketch. We will prove this theorem by induction on n. 

As base case, we have the empty run with just one initial state. The result is 

(ai,di) 



obvious. For the induction step, assume (Cs,vts) = (C'oi^'o) iCi,vi) . . . 

{Cn-i,i^n-i) {Cn,Vn) in TS, and {Cs'^vts') = (fi Cq,v'^) Ci,v[) 

_ Gn-i4n-i) Cn-i,i^n-i) ill TS'. According to the definition of the relation 



(ar^n)^ there exists e„ G Es such that /(e„) = o„, C„ = Cn-iUjen}, Vn |£;\{e„} = 
Vn-I, Vn{en) = A{vn-i) + d„, and Eft{en) < Vn{e„) < min{L/t(e) | e G 
En{Cn-i)}. Since ^ is a morphism, then p, C„ G C{S'). We now check 

that {p, Cn\ pt Cn-i) = {/x(e„)}. Suppose a contrary, i.e., ii{en) G pi C„_i. 
Then there is an event e' G Cn-i such that pi{e') = pi{en). From the definition 
of a morphism we get e! = e„, but it is impossible because e„ ^ Cn-i- Hence 
{pi Cn\ pi Cn-i) = {/^(cn)}- Again, from the definition of a morphism we have 
/'(/r(e„)) = ?(e„) = a„, Eft'{pi{en)) < Eft{en) < = A{iy!^_^) + 

dn < min{L/t(e) | e G En{C„,-i)} < min{L/t'(e) | e G En{pi C„_i)}. Thus, 

{pi Cn-i,v'„_i) (m with iy'„ |£;.\{^(e„)}= <_i, and z/;(/r(e„)) = 

A{i/'^_i) + dn, by the definition of the relation □ 



Hence, in the formal sense of Theorem 1 we have shown that the morphisms 
from Definition 2 do represent a notion of simulation. So, define a category of 
timed event structures as follows. 



Definition 3 Timed event structures (labelled over Act) with morphisms be- 
tween them form a category of timed event structures CTSaci, in which the 
composition of two morphisms pii : TSq — > TSi and pi 2 : TSi — > TS 2 is 
{pi 2 o pii) : TSo — > TS 2 , and the identity morphism is the identity function. 



Proposition 1 CTSaci is a category. 
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Following the standards of timed event structures and the paper UDI , we 
would like to choose timed words with word extension so as to form a subcategory 
of CTSAct- For each timed word w, we shall construct a timed event structure 
as follows. 

Definition 4 Given a timed word w = (oi, di)(a 2 , ^ 2 ) ■ • ■ {o-mdn), we define a 
timed event structure TSw = {E, <, #, I, Eft, Lft) as follows: E = {1,2, ■ ■ ■ , n}; 
< = {(bi) & E X E I 1 < i < j < n}; jl = 0; l{i) = at (i = 1,2- --n); 
Eft{i) = Lft{i) = di + . . . + di. 



The purpose of the construction is to represent the category of timed words 
with extension inside CESAct, and to identify runs oi w in TS with morphisms 
from TSw to TS, as expressed formally in the following two results. 

Proposition 2 The construction of the timed event structure TS^ from a timed 
word w extends to a full and faithful functor from the category of timed words 
(as objects) and word extensions (as morphisms) into CTSAct- 



Theorem 2 Given a timed event structure TS and a timed word w = (ai,di) 
... (a„,d„). For all run of w in TS, {Cs,vts) = {Co, 1^0) ... 

{Cn-i,Vn-i) ^ {Cn,Vn) such that Ci \ Ci-i = {ci} (0 < i < n), we can 

associate a morphism p. : TS^ TS such that p{i) = e^. Furthermore, this 
association is a bijection between the runs ofw inTS and morphisms p : TS^, — t 
TS. 

Proof Sketch. From the definition of a run and the construction of TS^ it 
follows that p as defined is indeed a morphism. 

Next, assume p : T S^ ^ T S to he a, morphism. We associate with p the run 

r of w as follows: r = {Cs, vts) = {Co, vo) {Ci, vi) . . . {Cn-i, Vn-i) 

{Cn, Vn), where Ci = p {1, . . . , f}. According to the definition of a morphism, r 
is indeed a run of w in TS. It is easy to check that the correspondence given 
above is one to one. □ 



4 TVF-Open Morphisms 

Given our categories of timed event structures and timed words, we can apply 
the general framework from defining a notion of TLF-open maps. 

Definition 5 A morphism p : TS ^ TS' in CTSAct is TW -open iff for all 
timed words w and w' , and morphisms such that the following diagram com- 
mutes: 
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TS,. 



TS 



TSu 



TS' 



fi" 



TS„ 



there exists a morphism p, : TS^' TS such that in the diagram 

TS 



h- 




p" 

the two triangles commute. 

Our next aim is to characterize TVE-openness of morphisms. 

Theorem 3 Let (Ci,vi) and {p C\,v'i) he reachable by w in TS and TS', re- 
spectively. A morphism p : TS — >■ TS' is TW-open iff whenever {p 
{C^, i/^) in TS' then (Ci, i/i) (C2, 1^2) in TS and p C2 = C'^. 

Proof Sketch. It follows similar lines as other standard proofs of the charac- 
terization the TIT-openness of a morphism (see e.g., |2j), using the definition of 
a morphism and Theorem 2. □ 

We do not require for the category CTSAct to have pullbacks. The following 
weak result suffices. 

Theorem 4 Given two TW-open morphisms pi : TSi ^ TS and p2 : TS2 ^ 
TS. There exists a timed event structure TSx and TW-open morphisms p'l : 
TSx ^ TS\, p'2 : TSx ^ TS2 such that the diagram commutes: 

L2 

TSx TS2 



Ml 



M2 



TSi 



TS 



Ml 



Proof Sketch. As a first step we construct a structure TSx = {Sx,Eftx, Lftx) 
as follows: 
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- Sx = + (S'cixC2 I Ci € C{Si), 3C € C{S) o C is maximal configuration in S, 
C = Ci {i = 1,2)) with ^CixCs = {ECixC2, <CixC2, #CixC2i hixC2), 
where 

• Ec^xC2 — {(ci) 62) S Cl X C2 I 3 e G C o e = {i = 1 , 2 )}; 

• (61,62) <CixC2 (6i,62) <t4> <i e' for some i G {1,2}; 

• #cixC2 = 0; 

• ^CixC2((ei,62)) = h{ei) for some i G {1,2}; 

- Eftx{ei,e2) = max{C/ti(ei),C/t2(62)}; 

- Lftx{ei,e2) = min{L/ti(ei),L/t2(62)}. 

Notice that the algebraic operation + is usually ‘interpreted’ by indicating that 
all events in one component are in the ^-relation with all events in the others 
(see 0 for more explanation). 

It is straightforward to show that TSx is a, timed event structure. 

Next, define maps /r} and as follows: /r'(ei, 62) = Ci {i = 1, 2). It is easy to 
check that /i{ and ^'2 are indeed morphisms. From the definition of a morphism 
and the construction of TSx, it immediately follows that /ri o = ^2 o 

Finally, we check TLF-openness of /i} (checking TLF-openness of /i'2 is simi- 
lar). Assume (Cs^.x^rs,.) {Cx,Vx) in TSx, and (^{ Cs^,i/tSi) 

(0^1) ^ ^ ^ (a^jd„) (M) in TS*!. According to Theorem 1, we get 

Cs^,i^Ts) (a^ioMi Cx,iy) ifJ-i C[,ix') in TS. Again, 

by Theorem 1 it holds {^2 Cs„,vtS2) C^,, 1^2) in TS'2. Using 

the fact that the diagram above is commutative, we have /xi o = fj,2° fJ-2- Since 

fj,2 is a TLF-open morphism, then {fi'2 Cx, 1^2) (C2, 1^2) in TS2, and /X2 C2 = 

fj.1 C'l, by Theorem 3. Suppose {e}} = (C{ \ a^i Cx), and {e^} = (C2 \ n'2 Cx)- 

Then from the construction of TS'2;, it follows {Cx,Vx) ((Ca;U{(e'i, e^}), ^'{). 
By the definition of a morphism, we obtain a(} {Cx U {(e}, e^)}) = C[. Hence Ai{ 
is a TLF-open morphism, by Theorem 3. □ 

5 Timed Bisimulation 

In this section, we first introduce a notion of TLF-bisimulation, using the concept 
of TLF-open maps. Then the standard notion of timed bisimulation is defined in 
terms of states of timed event structures. Finally, the coincidence of the bisim- 
ularity notions is shown. 

As was reported in cni, the open map approach provides a general concept 
of bisimularity for any categorical model of computation. The definition is given 
in terms of spans of TLF-open maps. 

Definition 6 Timed event structures TSi and TS2 are TW -hisimilar iff there 
exists a span TS\ TS TS2 with vertex TS of TW-open morphisms. 

Notice that it follows from m and Theorem 4 that TLF-bisimulation is 
exactly the equivalence generated by TLF-open maps. 
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Next, the notion of timed bisimulation is defined in terms of reachable states 
of timed event structures as follows. 

Definition 7 Two timed event structures TSi and TS2 are timed bisimilar iff 
there exists a relation B C RS(TSi) x RS{TS2), satisfying the following condi- 
tions: ((Csj, (C'sai ^ ^ and for all {{Ci,ni), (02,1^2)) & B it holds: 

(a) if (Ci,iyi) in TSi, then (02,^2) (C'2j^^2) '^^2 <md 

aC[,n[), iy'2)) G B for some (C^, 12'ff G RS{TS2); 

(b) if (C'2,t'2) (^*2:^2) TS2, then (Ci,:^i) {C[tv[) in TS\ and 

for some & RSiffS^). 

Finally, the coincidence of the bisimularity notions is established. 

Theorem 5 Timed event structures TSi and TS2 are TW -bisimilar iff they are 
timed bisimilar. 



Proof Sketch. 

(=i>) Let TSi 4^ TS TS2 be a span of TbF-open maps. Define a relation 
B as follows: B = {((Ci, z^i), (C2, :/2)) I G RS{TS,), 3{C,iy) G RS{TS) o 

fii C = Ci, ViO yii = V {i = 1, 2)}. Using Theorem 3, it is a routine to show that 
is a timed bisimulation. 

(<J=) Assume TS\ and TS2 to be timed bisimilar with a relation B as defined 
in Definition 7. We construct a span of TbF-open maps with a vertex TS = {S, 
<, #, I, Eft, Lft) defined as follows: 



- S' = + (ScixC2 I 3 ((Ci, vi), {C2, V2)) G yB o Ci is a maximal configuration in 
Si, z = 1, 2) with 

• = {(ei,e2) G Cl X C2 I 3 C', C" C Q o 

((Cf,<), {C'f,vlf)) G B, (C',:r') (Cf,uf), z = 1 , 2 , Iffe,) = ^2(62)}; 

• (61,62) <CixC2 (6i,62) Ci <i e' for some z G { 1 , 2 }; 

• #CixC 2 = 0; 

• ^CixC2((ei, 62)) = l\{ei) = ^2(62); 

- C/t(ei,e2) = max{C/ti(ei), C/t2(62)}; 

- i/t(ei,e2) = min{L/ti(ei), Lft2{e2)}. 



It is straightforward to show that TS is a timed event structure. 

Next, define maps fii : TS — >■ TSi as follows: /Zi((ei,62)) = e* (z = 1,2). 
By the construction of TS we obtain that /zi and /Z2 are indeed morphisms. 
We shall check TlU-openness of /zi (TlU-openness of /i2 is proved in a sim- 
ilar way). Assume {Cs,vts) ... (C, zz) in TS. According to 

Theorem 1, we get: (^1 Cs,vtSi) ... C,vi) in TSi, and 

(M2 Cs, VTsff (^2 C, V2) in TS2. Suppose (/Xi C, v^) {C[,v[) 

in TS\. By the definition of timed bisimulation, we have: ((/xi C, z^i), (/i2 C, V2)) G 

B, (ax2 0,^2) (C2,i^2). and ((C(, zz{), (C2, zx^)) G B, for some (C2,zx^) G 




296 N.S. Moskaljova and LB. Virbitskaite 

RS{TS2)- Let {ei} = {C'l \ C), and {62} = {C'2 \ fJ.2 C). From the construc- 

tion of TS', it follows that (ei, 62) G Es and C = CU{(ei, 62)} is a configuration 

in S. Again, from the construction of TS we get (C, {C ,v') in TS, and 

fjLi C = C[. Thus, Hi is a TLF-open morphism, by Theorem 3 . □ 



TSi : 



[ 0 , 1 ] [ 0 , 1 ] 

# 

T T 

b # b a 

[ 0 , 1 ] [ 0 , 1 ] [ 0 , 1 ] 



TS4, : 



[ 0 , 1 ] 

a 



b # b 

[ 0 , 1 ] [ 0 , 1 ] 

Fig. 3. 



TS5 : 



[ 0 , 1 ] 

a 



b 

[ 0 , 1 ] 



The timed event structures TSi and TS2, shown in Fig. 1 and 2 respectively, 
are not bisimilar. Next, consider the timed event structures in Figure 4 . It is 
easy to see that there are morphisms from TS3 to TS4 and to TS'5, and these 
morphisms are TLF-open. Hence we have a span of TLF-open maps between TS4 
and TS3. Timed bisimularity between TS4 and TS^ follows from Theorem 5 . 

6 Decidability 

In this section, we consider decidability questions for TLF-openness of a mor- 
phism and for timed bisimulation in the setting finite timed event structures, i.e. 
structures with finite set of events and for which all constants referred to in the 
earliest and latest occurrence times of events are natural valued. The subclass 
of timed event structures is denoted by T 5 n. 

As for many existing results for timed models, including results concerning 
verification of real-time systems, our decision procedure relies heavily on the idea 
behind regions (equivalence classes of states) of P, which essentially provides a 
finite description of the state-space of timed event structures. 

Given a timed event structure TS and i/, G T{TS), we let 1 / i/' iS (i) 

for each e G if it holds: [v{e)\ = \ v'[e)\, and (ii) for each e,e' £ E it holds: 
liy{e)l < lu{e')l O lv'{e)l < lv'{e')l, and lv{e)l = 0 = 0 . Here, for 

d G RJ, Idl and [dj denote its fractional and smallest integer parts, respectively. 
For V G r{TS), let [v] denote the region to which it belongs. Given TS G T 5 n, 
an extended state of T 5 ' is defined as a pair (C, [v]) with {C,v) G RS{TS). We 
consider {Cs, [ 0 ]) as the initial region of TS”. 



Proposition 3 Let TS £ T 5 n- 
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(a) Given an event e and a region [n], 

Eft{e) < v{e) < Lft{e) Vz/i G [v] o Eft{e) < ni{e) < Lft{e). 

(b) Given extended states (C, (C", [E]), 

(C, v) {C, u') Vi^i G [v] o (C, vi) (C", v'l) for some iy[ G [E] and 
d' G R^}-. 

For extended states {C ,[v]) , {C ,[v']) , we shall write {C,[iy]) {C ,[v']), 

if {C,v) {C',v'). An extended state {C,[v]) is called reachable by a timed 

word w, if (C, n) is reachable by a timed word w. 

We can now give a characterization of TVF-open maps in terms of extended 
states. 

Theorem 6 Let TS\,TS 2 G T5n, and (Ci, [i^i]), Ci, [v'f\) be extended states 
reachable by w inTSi andTS 2 , respectively. A morphism p. \ T S\ ^ T S 2 is TW- 

open iff whenever {p Ci, [ly'i]) {C' 2 , [E 2 ]) in TS 2 , then {Ci, [vi]) (C 2 , [ 1 ^ 2 ]) 

in TSi and /t C '2 = 

Proof Sketch. It follows from Theorem 3 and Proposition 3. □ 

Notice, that the theorem above implies decidability of TIF-openness of a 
morphism between TS and TS' from T5 n- 

Theorem 7 Let TS\,TS 2 G T5n- If there exists a span of TW -open maps 
TSi -<P— TS -ELf TS 2 then there exists TS' G T5n of size bounded by the size 

ofTSi and TS 2 and with TW-open morphisms TS\ 4^- TS' rS' 2 . 

Proof Sketch. Since TS\ 4^— TS rS '2 is a span of TW-open maps, then 
TSi and TS 2 are TIF-bisimilar, by Theorem 5. This means that there exists 
a timed bisimulation B between states of TSi and TS 2 . Using B we construct 
TS' as in the converse part of the proof of Theorem 5. From the construction 
it follows that TS' G TSn- The number of extended states in TS' is bounded 
by I if |! • • (c + where | if | = | ifi | * | A 2 | (| Ci | is the number of 

events in TSi {i = 1,2)), and c is the greatest constant appearing in the timing 
constraints in TS\ and TS' 2 . □ 

Corollary 1 Timed bisimulation is decidable for TS,TS' G T5n- 

7 Conclusion 

In this paper, we tried to develop a categorical framework of open maps for timed 
event structures to take into account processes’ timing behaviour in addition to 
their degrees of relative concurrency and nondeterminism, and then to apply the 
framework to characterize and, also, to decide timed bisimulation. 

In a future work, we will extend the results obtained to other behavioural 
equivalences of timed systems (e.g., timed variants of pomset based equiva- 
lences). Some investigation on the existence of coreflection between timed true 
concurrent models (timed net processes and timed event structures) are now 
under way and we plan to report on this work elsewhere. 
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Abstract. Polynomial time partial information classes are extensions 
of the class P of languages decidable in polynomial time. A partial in- 
formation algorithm for a language A computes, for fixed n £ N, on 
input of words xi, . . . ,x„ a set P of bitstrings, called a pool, such that 
Xa(xi, . . . , Xn) G P, where P is chosen from a family P of pools. A 
language A is in P[H], if there is a polynomial time partial information 
algorithm which for all inputs (xi, . . . ,x„) outputs a pool P £ P with 
Xa{xi, . . . , Xn) G P- Many extensions of P studied in the literature, in- 
cluding approximable languages, cheatability, p-selectivity and frequency 
computations, form a class P\P] for an appropriate family P. 

We characterise those families P for which P\P] is closed under certain 
polynomial time reductions, namely bounded truth-table, truth-table, 
and Turing reductions. We also treat positive reductions. A class P[2?] is 
presented which strictly contains the class P-sel of p-selective languages 
and is closed under positive truth-table reductions. 

Keywords: structural complexity, partial information, polynomial time 
reductions, verboseness, p-selectivity, positive reductions. 



If a language A is not decidable in polynomial time one may ask whether it never- 
theless exhibits some polynomial time behaviour. If there is no polynomial time 
algorithm that answers the question “a; G A ?” for all inputs x, there may still ex- 
ist a partial information algorithm. For a tuple of input words (xi, . . . , Xn) such 
an algorithm outputs some partial information on membership of these words 
with respect to A. More precisely it narrows the range of possibilities for values 
of xa{x\, . ■ . ,Xn), where XA is the characteristic function for A. Many types of 
partial information have been studied, including verboseness, approximability, 
(strong) membership comparability, cheatability, frequency computations, eas- 
ily countable languages, multiselectivity, sortability. For detailed definitions and 
discussions of these notions see for example [2111611 5liSll4ll3j . 

To get a more unified picture, ^ introduced the recursion theoretic notion of 
P-verhoseness, where the type of partial information is specified by a family P of 
sets of bitstrings. The class of polynomially verbose languages, whose formal 
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definition given below, is denoted Basic properties of these polynomial 

time 2?-verboseness classes are presented in m- 

Reduction closures of partial information classes have been the focus of much 
interest, mostly due to the fact that the polynomial time Turing reduction closure 
of P-sel, the class of p-selective sets, is exactly P/poly. Hence, a language is Turing 
reducible to a p-selective language iff the language has polynomial size circuits. 
It is also known that P-sel is closed under positive truth-table reductions, but 
not under 1-tt reductions. Reductions to p-selective sets have been studied in 
detail in H2!- Opposed to selectivity, cheatability behaves quite differently: The 
class of n-cheatable languages is known P| to be closed under Turing reductions. 
For other notions like strongly membership comparable sets eg only little was 
previously known concerning their closure properties. 

Another important motivation to look at reductions to partial information 
classes is that one can prove results of the following type: if ’P[D] contains, for 
certain families T>, languages which are NP-hard for certain polynomial time 
reductions, then P = NP. One of the best results m in this respect is that 
if P[3-SIZE2] contains a language which is NP-hard for n^“'^-tt reductions, then 
P = NP. 

We fully characterise the partial information classes ’P[D] which are closed 
under 2-tt reductions, bounded truth-table reductions and under Turing reduc- 
tions. It turns out that exactly the classes of n-cheatable languages are closed 
under any of these reductions. We also treat positive truth-table reductions and 
present a class P[T’] strictly containing P-sel which is closed under positive truth- 
table reductions. 

This paper is organised as follows. First we give some definitions and ba- 
sic facts concerning partial information classes. In Section 2 we briefly discuss 
closure under many-one and 1-tt reductions and give a simple combinatorial 
characterisation of the classes P[T’] closed under these reductions. In Section 3 
we show the main theorem which characterises the classes P[T>] that are closed 
under different truth-table and Turing reductions. In Section 4 we examine pos- 
itive truth-table reductions. Here again, we reduce the question of whether a 
class ’P[D] is closed under positive fc-tt reductions to finite combinatorics. 

1 Preliminaries 

Notations. Languages are subsets of S* = {0, 1}*. The join of two languages A 
and B is A(BB := { Ox | a: S ^}U{ lx \ x G B}. Let B := {0, 1}. For a language A 
the characteristic function XA- A"* — >■ B is defined by xa{x) = 1 iff x G A. We 
extend xa to tuples of words by xa(xi, ■ ■ ■ ,Xn) '■= xa(xi) ■ ■ ■ XA(xn)- In the 
following, elements of E* for which membership in languages is of interest are 
called words, elements of B* which are considered as possible values of charac- 
teristic functions are called bitstrings. For a bitstring b the number of I’s in b is 
denoted #i(6), b[i] is the i-th bit of b, and b[ii, . . . ,ik] '■= b[ii] ■ ■ ■ b[ik]. We extend 
this to sets of bitstrings by setting P[ii, ■ ■ ■ ,ik] ■= ■ ■ ■ Ak] \ b G P] . 

Partial information classes. In order to define partial information classes, we 
first need the notion of pools and families. 
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Definition 1 (n-Pools, n- Families). Let n > 1. A subset P C B" is called 
an n-pool. A set T> = {Pi, . . ■ ,Pr} of n-pools is called an n-family if 

1. V covers B", that is lJi=i 

2. T> is closed under subsets, that is P G T> and Q Q P implies Q G T>. 

Definition 2 (Polynomially X>- Verbose). For a given n- family T> a lan- 
guage A is in the partial information class 'P\T)] (respectively ldist[2^]/^ iff there 
is a polynomially time-bounded deterministic Turing machine that on input of 
n words (respectively distinct words) x\, . . . ,Xn outputs a pool Q G T> such that 
Xa{x\, ■ ■ ■ ,Xn) G Q. The languages in 'P\D] are called polynomially P-verbose. 

We present some definitions that make it easier to deal with polynomial 
2?-verboseness and then state some known facts which will be applied in the 
following. For more details on polynomial P-verboseness see m- 

Definition 3 (Operations on Bitstrings). 

1. Let Sn be the group of permutations of {1, • . . ,n}. For a G and b G B" 
we define a{b) := 6[cr(l)] ■ ■ - b [cr(n)]. 

2. For i G {1, . . . ,n} and c S B define projections Trf : B" — >■ B" by 7rf(6) := 

6[1] • • • — 1] c + 1] • • • 6[n] . 

3. For i,j G n} define a replacement operation pij : B" — ?> B" by 

Pi,j{b) := b' where b'[k] := b[k] for k j and b'[j] := b[i]. 

We extend these operations from bitstrings to pools by u>{P) '■= {t^(&) | b G P}. 
An n-family T> is said to be closed under permutations, projections and replace- 
ments if for all permutations, projections and replacements to and all P G T> we 
have Lo{P) G T>. 



Definition 4 (Normal Form). An n-family is in normal form if it is closed 
under permutations, projections and replacements. 



Fact 1 (Normal Form). For every n-family T> there is a unique n-family T>' 
in normal form with P[I?] = P[P']. 

Fact 2 (Class Inclusion Reduces to Family Inclusion). For all n-families 
T> and £ in normal form we have P[X>] C P[f] ijf'DQS. 



Fact 3 (Change of Tuple Length). Let T> be an m-family and m < n. Define 
the following n-family \D \ „ := [P Q B" | Vii < ■ ■ ■ < im '■ P[ii, ■ ■ ■ , im] G D}. 
Then P[I?] = P[|"2?]„] . Furthermore, ifD is in normal form so is 



Fact 4 (Intersection). For all n-families D and £ in normal form we have 

p[r>] np[£:] = p[r>n£’]. 
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Definition 5 (Generated n- Family). For n-pools Di, . . . , Dr the minimal n- 
family D in normal form for which {D\, . . . , Dr} is denoted by {D\, . . . , Dr). 
It is the closure of {Di, . . . , Dr} under subsets, permutations, projections and 
replacements. We say that D\,. . . , Dr generate {D\, . . . , Dr). 

Some n-families are of special interest as their partial information classes 
have been studied extensively in the literature. We write these special families 
in capital letters with the tuple length attached as index. 

Definition 6 (Special Families). 

1. Let SEL„ := ({0"-n* I 0 < i < n}). 

2. For 1 < k <2^ let fc-SiZE„ := {P C B" | |P| < fc}. 

3. For 1 < k < n + 1 let /c-CARD„ := {P C B" | |{#i(6) | 6 S P}| < fc}. 

4 . Let BOTTOM^ := ({6 I #i{b) < l}) and top„ := ({& | #i(b) > n - l}). 

The class P[seL 2 ] is exactly the class P-sel of p-selective languages, that is 
languages A which have a polynomial time selector function. Such a selector gets 
two words u and v as input and selects one of them. Provided u G A or v G A, 
the selected word must also lie in A. The class of p-selective languages has been 
extensively studied, starting with HOI. 

Fact 5 (SEL). P-sel = P[seL 2 ] = P[sel„] for n>2. 

Languages in P [(2" — 1)-SIZE„] are called n-approximable, n-membership 
comparable, non-n-p-superterse or n-p-verbose. The languages in P[n-SIZE„] are 
sometimes called n-cheatable in the literature, but especially in the older lit- 
erature this term is also used for the languages in P[2"-SIZE2^]. The following 
important fact is implicitly proven in 

Fact 6 (SIZE). P[PsiZEfc] = P[/c-SlZE„] for n> k. 

Languages in P[n-CARDn,] are called easily countable Hi. The languages in 
P[toPji] and P[bottom„] have no special names in the literature, but they will 
come up in different places in the following proofs. 



Reductions. In this paper all reductions under consideration will be polynomial 
time truth-table or Turing reductions defined in the standard way, see 1 1 iSj for 
detailed definitions. We write for many-one reductions, for truth-table 
reductions with k queries, for truth-table reductions with a constant number 
of queries, for truth-table reductions with a polynomial number of queries, 
and <!(. for Turing reductions. We write for positive truth-table reductions 

with k queries. In a positive reduction for each input word the boolean function 
that evaluates the answers to the queries is a monotone function. We write <ptt 
for positive truth-table reductions with a polynomial number of queries. 
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2 Many-One and 1-tt Reductions 

We review what is known uni about closure under polynomial time many-one 
and 1-tt reductions for classes P[P] and Hdist[2^]- 

Theorem 1 (Many-One Reductions). Let V he an n-family. Then P[2?] is 
closed under <^-reductions. But Hdist[2^] is not closed under <^-reductions, 
unless ldist[2^] = 'P[D'] for some n-family T>' . 

Not all P[T>] are closed under 1-tt reductions. To characterize the families T> 
for which this closure property holds, a new type of operation on bitstrings and 
pools is needed: 

Definition 7 (Bitfiip). For n > 1 and i G {1, . . . , n} define the bitfiip oper- 
ation flipj : B" — >• B" by flipi(6) := 6[1] • • • — 1] (l — b[i])b[i -|- 1] • • • b[n\. This 

operation is extended to pools and families of pools in the obvious way. An n- 
family T> is called closed under bitfiip iffiip^(fD) = T> for all 1 < i < n. 

Theorem 2 (1-tt Reductions). For all n-families V in normal form, ^[D] is 
closed under <^_^^-reductions iff T> is closed under bitfiip. 

If an n-family T> is closed under bitfiip then P[H] is also closed under com- 
plement. The converse does not hold as the family T> = SEL 2 shows. 

3 Prom 2-tt to Turing Reductions 

As we know from pirnnj the closure of P-sel under polynomial time Turing 
reductions equals P/poly. But all P[P] are proper subclasses of P/poly. Therefore 
we have: 

Fact 7. For SEL„ QT> 2-"size„ the class F\D\ is not closed under polynomial 
time Turing reductions. 

In 13 Theorem 2.9] Beigel, Kummer and Stephan construct languages A 
and B such that A B G IdistlTOPa] and A ^ P[(2"- — 1)-SIZE„] for all n. In 
terms of closure under reductions this yields: 

Fact 8. Let T> 2^-SiZEn be in normal form. Lf top„ C T> or bottom„ C T>, 
then P[H] is not closed under -reductions. 

On the other hand Amir, Beigel and Gasarch PC! and Goldsmith, Joseph 
and Young prcni showed that cheatability classes are closed under polynomial 
time Turing reductions. 

Fact 9. For all n > 1 the classes P[n-SIZE„] are closed under -reductions. 

In the following we show that this cannot be extended to other classes and 
even that the cheatability classes are the only nontrivial classes in our context 
which are closed under 2-tt reductions. 

To deal with k-tt reductions, we introduce for each language A a language 
Ak-tt which is many-one complete for the fc-tt reduction closure of A. 
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Definition 8. For a language A and k > 1 define 

Ak-tt ■= {{xi, . . . ,Xk,(f) I 0: B'" -)>B, (j){xAixi,...,Xk)) = l} ■ 

This definition is inspired by the study of btt-cylinders fmw\ . A language is a 
k-tt cylinder iff Ak-tt <m 

Lemma 1. For all languages A and B we have B A iff B Ak-tt- 

Proof. Suppose B A via M . If on input x the machine M computes queries 
qi,. . . ,Qk and a boolean function such that fxifKAiQij ■ • • j <?fc)) = Xb{x), then 
X G B iff (gi, . . . , qk, 4 >x) G Ak-tt- Thus B <p Ak-tt- 

Suppose B <Pj Ak-tt via M. If on input x the reduction machine M computes 
{qi, . . . ,qk, 4 >x), then asking the oracle A with queries qi, . . . ,qk and evaluating 
the answers with fx constitutes the fc-tt reduction to A. □ 

As an immediate corollary we obtain that a language class C closed under poly- 
nomial time many-one reductions is also closed under k-tt reductions, iff for 
every A G C we also have Ak-tt G C. 

In order to prove the main results of this section we first show a rather 
basic fact on bounded truth-table reductions. Although this fact should be well 
known, we could not find it in the literature. It states that polynomial time fc-tt 
reductions can be replaced by sequences of 2-tt reductions. 

Lemma 2. Let A and B he languages with A B. Then there exist inter- 
mediate languages Ci, . . . ,Cr such that A <2_tt ’ ’ ‘ -2-tt <2-tt 

Proof. Let = {bi , . . . , 62'=} where the hi are in lexicographic order. The chain 
of 2-tt reductions from A to B will consist of the following two subchains 

A <2-tt — Ltt ^b 2 ^2-tt ■ ■ ■ — Ltt 

-2-tt ^2-tt Hk-l <2-tt ■ ■ ■ — Ltt -^1 — Ltt 

The languages for z G {1 , . . . , fc} are defined as follows: 

H-, := {(xi,...,Xi,b) I xs(xi, . . . ,Xi) = 6 g B*}. 

Clearly, Ffi <i_tt B and <2-tt Thus, the second part of the chain from 
A to S is correct. 

The first part consists of the following languages Di, with 6 G B^ : 

Db := (^Bk-tt n {(xi, . ..,Xk,(j)) I Xb{xi,. ..,Xk) > lex &}) ®Hk. 

Note that = Bk-tt © Hk and hence by Lemma [H we have A <i_tt Db^. Note 
furthermore that we also have F>b^k <i_tt Hk. 

To show Db^ ^2-tt T*&i+i> ^ be an input word, li x = ly then we must 

decide whether y G Ftk which can trivially be done by passing on ly as a query to 
Dbi+i . If X = 0 (xi, . . . , Xfc, we ask two queries: we ask qi := 0 (xi , . . . ,Xk, 4 i) 
and (72 := 1 (xi, . . . , x^, bi). If the answer to the second query is “yes” we know 
Xb{xi, - . - ,Xk) = bi and output (fibi), ignoring the answer to the first query. If 
the answer to the second query is “no” we output the answer to the first query 
which is, indeed, correct. □ 
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Corollary 1. A language class C is closed under <^^^^-reductions iff C is closed 
under < 2 _^^-reductions. 

For the formulation of our Main Theorem^, we need the following definition: 

Definition 9 (fc-Cone). For n,k > 1 and an n-family T> in normal form an 
nk-pool P is a fc-cone for I) if for all tuples {(f>i, . . . , 4>n) of functions (f>i : — >■ B, 

the set of hitstrings 

{4>i{b[l, ...,k]) 4>2{b[k + 1, . . .,2k]) ■ • - l)fc + 1, . . .,nk]) \ bG P} 

is a pool ofT>. The nk-family of all k-cones for T> is denoted by k-cones{T>) . 

Theorem 3. For an n-family T> in normal form and k > 1, P[I?] is closed under 
polynomial time k-tt reductions iff \T>]nk Q k-cones (fD) . 

Proof. Suppose \T>]nk C fc-cones(P). Then every pool D S \V]nk is a fc-cone 
for V. To prove that P\D] is closed under <^_(.(.-reductions, by Lemma Gland 
Theorem [0 it suffices to show that for every A &'P\T)] we have Ak-tt SP[P]. 

Let A € P[[2i’]rtfc] via M. Then S V\D] is witnessed by the following 
algorithm. On input x\, . . . ,Xn test whether the Xi are in syntactically correct 
form, that is test whether Xi = (^y\, . . . ,y\, (pf) for some y) and (pi. If not, replace 
Xi by an x' of correct syntax. If we have found a pool D for this input of 
correct syntax, we find a pool for the original input by projecting D in the i-th 
component to 0, using that T> is closed under projections. So suppose the Xi 
are all of the form Xi = {y\, . . . ,yl,(pi). Let M compute a pool D' G \V]nk for 
y\, ■ ■ ■ ,yhyl, ■ ■ ■ ,yl^ ■ ■ ■ ,y\ , ■ ■ ■ ,Vk - Because D' is a fc-cone for V, the pool 

D:={cPffb\,...,bl)(P2{hl,...,bl)---(Pffb^,...,bl)\b\---hl€D') 

is a pool in T> and by definition of Ak-tt we have {x\, . . . , Xn) G D. 

For the opposite direction, suppose that P[T>] is closed under ,.^-reductions. 
We will show \V]nk C k-cones(V) by showing P[[I?]„fc] C P [A:-cones(2?)] and 
using Fact|^ Consider a language A & ¥ \\V]nk\. We will exhibit an algorithm 
Ma which witnesses A G P [/c-cones(P)] . Because A G P[|’2?]„fc], we have A G 
P[2?]; and because P[2?] is closed under k-tt reductions by Lemma G1 there is a 
machine M which witnesses Ak-tt G P[2^]- 

In order to keep the rest of the proof more readable, we introduce some 
ad-hoc definitions and abbreviations: 

^ := { (</)!,..., I 0^: B for 1 < z < n}, 

(j) := (^ 1 , . . . , (j)n) J 

X . Xp . , . . . , , . . . , Xp, ) , 

Furthermore, for every input tuple x € and every (j) G ^ we write 

X O (j) . ((^1; ■ ■ • (^1? • ■ • ^ 2 ) 5 ■ ■ ■ 5 (^1 5 • ■ • ) ’ 
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For a bitstring 6 G a pool Z) C B”^ and some cj) G <I> we write 
^{D) :={m\bGD}. 

Finally, for a pool D C B" and (j> G we write := {6 | 4>{b) G V}. 

We describe how algorithm M^i on input x computes a pool for x: 

For all (j) G (P: 

Compute M{xo (j)) =: G V. (We then know ° 4’) ^ D^.) 

Then compute (It holds xa{x) G (^“^(D^).) 

Now compute 

D. := n 4~\D^). 

Then xa(x) G D^, and G fc-cones(P) as 4>{Dx) Q G T> for all ij) G (P. □ 

Lemma 3. Let T> be an n-family in normal form. IfP[T>] is closed under poly- 
nomial time n-tt reductions, then T> = m-SiZEji for some m G N. 

Proof. Suppose P\D] is closed under n-tt reductions, where n is a fixed constant. 
Let m be minimal with V C m-SiZE„. Let D = {bi, . . . ,bm} G T> be a pool of 
maximal size. We have to show that for every pool Z1 C B" with |if| — m 
already E gV. Suppose E = {ei, . . . , e^}. By Fact Olfor change of tuple-length 
we know that the pool D' := { (6^)" | bi G Ej, where each bitstring consists of n 
copies of an original bitstring from E, is in Now define boolean functions 

4>i,...,4>n such that 4>i{bj) = ej[i]. Then the image of E' under . . . , (pn) 
is E. Because E' is an n-cone for V, it follows that E gT>. □ 

Lemma 4. Let E be an n-family in normal form with SELji C E such that P\E\ 
is closed under polynomial time 2"-tt reductions. Then E — 2"-SIZE„. 

Proof. Because P\E] is closed under 2"-tt reductions, by Theorem 0 we know 
that [P]n 2 " C 2"-cones(P). It suffices to exhibit a pool E G and 2^-ary 

boolean functions pi, . . . ,pn such that the image of E under {pi, . . . , pn) is B". 
Note that because |"sel„]„ 2 " = sel„ 2 " by Fact0 we have sel„ 2 " C [P]„ 2 "- We 
choose E C B"^” as a pool of size 2”: 

E := {&!, . . . , 62"} with := 

This means that each bitstring bi consists of the concatenation of n copies of a 
bitstring of length 2". Because the bitstrings in E form an ascending chain (see 
Definition El E G SEL„ 2 " C [P]„ 2 "- Let B" = {ci, . . . ,C2"}. Now define boolean 
functions pj such that pj{0'^ = Ci[j]. For those b which are not of the form 

q 2 choose some arbitrary value for pj{b). The choice of pj ensures 

{pi,...,pn){bi) = = Ci[l] = Ci. 



This yields {pi,.. .,pn){E) = B". 



□ 
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We sum up the preceding results in the following theorem which characterises 
the classes closed under 2-tt reductions as well as under Turing reductions: 

Theorem 4 (Main Theorem). For n-families V in normal form with V yf 
2”-SIZE„ the following are equivalent: 

1. T> = m-SiZE„ for some m < n. 

2. ~P\D] is closed under polynomial time 2-tt reductions. 

3. P[T>] is closed under polynomial time Turing reductions. 

Proof. The class P[m-SIZE„] with m <n\s equal to P[TO-SiZEm] by Fact El The 
class P[m-SiZErn] is closed under polynomial time Turing reductions by FactEl 
A class closed under Turing reductions is also closed under 2-tt reductions. 

If for an n-family T> in normal form P[T’] is closed under 2-tt reductions, then 
by Corollary [0 it is also closed under polynomial time n-tt and 2"-tt reductions. 
By Lemma 13 P — m-SiZE„ for some m. By assumption m 2". If to > n we 
would have SEL„ C V, which is impossible by LemmaE Therefore T> = to-SIZE„ 
for some m < n. □ 



4 Positive Reductions 

A motivation to investigate positive reductions is the fact that NP is closed un- 
der polynomial time positive Turing reductions. In general, when dealing with 
classes which are not (known to be) closed under 1-tt reductions it suggests itself 
to look for closure under some kind of positive reduction. Regarding reductions 
to languages in partial information classes it can happen that a reduction type, 
although more powerful than some other in general, looses its extra power when 
the oracle is taken from P[T’] for certain T>. For example it can be shown that 
querying an arbitrary number of queries to a language B G P[fc-SiZEfc] can always 
be replaced by putting only fc — 1 of these queries to oracle B. Regarding posi- 
tive reductions, Selman m showed that a polynomial time positive truth-table 
reduction to a p-selective language can always be replaced by a many-one reduc- 
tion. It follows that P-sel = P[seL 2] is closed under ptt-reductions. We extend 
this result to SEL2 U 2-SIZE2, a family in normal form strictly above SEL2. The 
partial information class P[seL 2 U 2-SIZE2] is exactly the class of languages L, for 
which L and its complement L are strictly 2-membership comparable, see H2I. 

Theorem 5. If A B and B € P[seL2 U 2-SIZE2] then A <p, B. 

Proof. First, note that SEL2 U 2-SIZE2 = SEL2 U {01, 10}. In the following we will 
call this latter pool the xor-pool. 

Let A B via a reduction R and let B € P[seL 2 U 2-SIZE2] via a ma- 
chine AI. To show A <Pj B, upon input x we must compute a query q such that 
Xa(x) = XB(q)- Let qi, . . . ,qk be the polynomially many queries produced by R 
upon input x and let (j) be the monotone evaluation function used by R. Note 
that for once k depends on the length of the input x. 

We define a graph G = (R, E) with coloured edges as follows. The vertices 
V = {qi, . . . ,qk} are exactly the queries qi. For each pair {qi, qj) with qi <iex qj 
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the machine M outputs (possibly a subset of) one of the three pools {00, 01, 11}, 
{00, 10, 11} and the xor-pool {01, 10}. In the first case there is a black directed 
edge from to qj, in the second case there is a black directed edge from qj to 
If the xor-pool is output, qi and qj are connected by an undirected red edge. 

This graph has the following two properties: First, if qi G B then for every 
black edge going from qi to a vertex qj we also have qj G B. Second, if qi and qj 
are connected by a red edge, then qi G B iff qj ^ B. 

We compute a single query q — qi such that knowing xb(<i) yields XB^Qj) 
for all other vertices. To do so, we apply the following pruning algorithm to the 
graph: 

Search for a red edge (g^, qj) plus another node qg which is connected to both 
qi and qj by black edges. If both black edges go from qi to g^ and qj we know 
Xb{<u) = 0, for being connected by a red edge exactly one of the two words qi 
and qj must be in B. Thus we can remove qi from the graph. Likewise, if both 
edges go from qi and qj to qi we know xb{<u) = 1 and we can also remove qi. 
Now, if the first black edge goes from qi to qi and the second from qi to qj , the 
graph property yields XB(qi) = 1 and XB{q_j) = 0. Conversely, if the black edges 
go the other way round, we know XB{<li) = 0 and XB{<lj) = 1- In either case we 
can remove both qi and qj. 

For the remaining graph G' = {E' , V') there are two possible situations: 
either the graph no longer contains a red edge or it contains a red spanning tree. 

If the graph contains no red edge, it is a tournament; that is for all vertices in 
the graph the machine R behaves like a selector. Using this selector we can com- 
pute a pool from SEL|^/| for the characteristic string of the vertices in the graph. 
To compute the special query g, we proceed as follows: for each bitstring b G P, 
in order of increasing number of I’s, compute (j){b). If cj){b) = 0 (or (j){b) = 1) for 
all 6, we do not have to query the oracle at all. Otherwise there is exactly one 
bitstring such that (f>{b) = 0, but (j){b') = 1 for the next bitstring. We take the 
word at the position where a 0 in 6 changes into a 1 in 6' as our query g. 

If the graph has a red spanning tree, knowing the characteristic value of 
any vertex in the graph immediately yields the characteristic values of all 
other vertices. Hence, we can compute two bitstrings bo and b\ such that 
Xs(gi, . . . , gfe) G {bo,bi}. Compute ^(&o) and <p{bi). If these are equal, we do 
not need to query B at all. If they are different, there must exists a position i 
such that &o[*] = 4’(bo) and bi[i] = 4>{bi) and we can ask g := g^. □ 

Corollary 2. Let T> be a family with f\D\ C P[seL 2 U 2 -SIZE 2 ]. Then is 
closed under polynomial time positive truth-table reductions. 

Proof. If A <ptt B G then A <p, B by Theorem 0 But P[T>] is closed 

under many-one reductions by Theorem 0 Therefore A Gf’\T)]. □ 

We now exhibit some families T> for which P \D] is not even closed under 2-ptt 
reductions. To do this, we proceed similarly as in the previous section where we 
showed that certain f’\D] are not closed under fc-tt reductions. For every A we 
introduce a typical language Ak-ptt which is fc-ptt reducible to A. We then define 
positive fc-cones and with these we characterize the families T\D] closed under 
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fc-ptt reductions. Finally we present some special families that do not comply 
with this characterization. 

Definition 10. For a language A and k > 1 define 

Ak-ptt ■= {(xi,. . . ,Xk,(l)) I (/i: l'" B js monotone, 4 >{xa{xi, . . . ,Xk)) = l}. 

The following lemma is the analogue of Lemma Q for positive reductions. A 
proof is almost identical to the proof of that lemma and is therefore omitted. 

Lemma 5. For all languages A and B we have B 'Fi'k-ptt ^ff ^ ^fc-ptt- 

Definition 11 (Positive fc-Cone). For k > 1 and an n-family V in normal 
form, an nk-pool P is a positive fc-cone for T> if for all tuples {4>i, . . . ,4>n) of 
k-ary monotone boolean functions (pi the set of bitstrings 

{(pi{b[l, ■..,k]) p 2 {b[k + 1, . . .,2k]) ■ ■ ■ (pn{b[{n - l)fc + 1, . . .,nk]) \ bG P} 

is a pool of T> . By fc-pcones(21) we denote the nk-family of all positive k-cones 
for T>. 

Theorem 6. Let T> be an n-family. Then P[15] is closed under <ff._^^^-reduction 
iff \T^]nk Q fc-pcones(P). 

The omitted proof is essentially the same as for Theorem 0 for fc-tt reductions. 

Theorem 7. Let V be a 2-family with BOTTOM 2 C T>. Lf P[T>] is closed under 
2-ptt reductions, then V = 4-SIZE2. 

Proof. The pool {00, 01, 10} is in V and therefore with the definition from Fact 0 
for change of tuple-length, it is easy to check that D — {0000, 0010, 0100, 1001} 
is in [H] 4 . Choose (pi = p 2 SiS the 2-ary boolean or-function. Then we have 
(pi,p 2 )(D) = {00,01,10,11} = B^. Because V is closed under 2-ptt reductions 
(^ 1 , p 2 ){D) has to be in T>. It follows that T> = 4-SIZE2. □ 

If a class C is closed under 2-ptt reductions then co-C also is closed under 
2-ptt reductions. Therefore Theorem 0 also holds if TOP 2 C T>. 

We have now characterised the families T> for tuple-length n = 2 for which 
P[T>] is closed under positive truth-table reductions: 

Theorem 8. For 2-families T> in normal form with T> yf 4-SIZE2 the following 
are equivalent: 

1. P\T>] is closed under polynomial time ptt reductions. 

2. P[D] is closed under polynomial time 2-ptt reductions. 

3. V C SEL2 U 2-SIZE2. 

4 . BOTTOM 2 2 ^ o-iT'd TOP 2 2 

While in the previous section on general fc-tt and Turing reductions a com- 
plete picture was presented, in the case of positive reductions there remains work 
to be done. The above result should be extended to general tuple-lengths n and 
to positive Turing reductions. 
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Abstract. We show that nnder some truth-table oracle B there are 
almost exponential gaps in the (infinite often) hierarchy of bounded- 
error probabilistic time, in particnlar BPP^ = ZPTI M E^{n). This 
proves a main theorem in [5], which is extended to the theoretical limit. 



1 Introduction 

Separation and hierarchy theorems are among the earliest and most basic results 
in structural complexity theory. Without dense hierarchies it makes not much 
sense to look for lower bounds. After discovering good separation results for 
nondeterministic complexities, the topic seemed to be finished up to some small 
open problems (e.g. the logarithmic gap in the deterministic time hierarchy). 

The situation changed when probabilistic algorithms gained in importance. 
The problem of separating complexity classes of probabilistic machines with 
small error probability (Monte-Carlo machines) came to our attention, when we 
unsuccessfully tried to derive separations for Monte-Carlo space classes from the 
existence of very slowly increasing Monte-Carlo space constructable bounds [8], 
[9] (cf. also [4]). Then Fortnow and Sipser [5] claimed the existence of an oracle, 
under which Monte-Carlo polynomial time collapses to linear time. This had been 
the first example of a relativized collapse of a natural complexity hierarchy. As 
known since several years this proof is not complete (cf. Berg and Hastad [3] and 

[6] and a lot of e-mail discussions between L. Fortnow and the second author 
since September 1989). For an overview of the problem cf. also Hemaspaandra 

[ 7 ] . 

There is a small number of other oracle constructions, which yield hierarchy 
gaps under oracles. All of them are related to nondeterministic time: Rackoff and 
Seiferas [11] show that the small gaps in the time hierarchy (i.e. the requirement, 
that t{n + 1) G o(T(n)), cf. [12]) cannot be closed using relativizable techniques 
and Allender, Beigel, Hertrampf and Homer [1] present an oracle, under which 
there are almost exponential gaps in the almost-everywhere hierarchy. 

In this paper we present a rigorous proof for the result claimed by Fort- 
now and Sipser for truth-table oracle machines, that is we give the complete 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 311- 1!^ 2001. 

© Springer- Verlag Berlin Heidelberg 2001 
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construction for an oracle B, under which ZPTIME^{n) = BPP^. Further- 
more we extend the gap to the theoretical limit, which is given by relativizable 
separation techniques (see [1]). In particular there exist oracles C so that 

ZPTIMEg{n) = BPTIMEg{f), if S 0{2^). 

We will divide the proof of this result into two parts. In a first part (Sections 2.1 
- 2.4) we attack the main combinatorical problems in our proof of the relativized 
collapse. Here we consider a special coloring problem of graphs. Based on these 
results we will give a proof of a relativized collapse in Section 3. In Section 4 
we will add some conclusions and discuss open problems connected with the 
presented results. 

Our main result can also be proven for general (not only truth-table) oracle 
machines and for some other complexity classes [10], but the proof is even more 
involved than in this case. Earlier doubts in the completeness of the proof caused 
us 

— to restrict this paper to the truth-table case and 

— to include a detailed proof rather than a broad discussion of the general case. 

We assume that the reader is familiar with the fundamental notions of stan- 
dard Turing machine models and asymptotic analysis (cf. [2]). So we won’t give 
detailed exposition of the definitions but rather a short setting of the notations 
we will use throughout. We will give some mathematical notations next. Machine 
related notations, especially definitions of Monte-Carlo, Las- Vegas machines and 
truth-table oracles, will be given in Section 0 

Let S := {0, 1}. By E* we denote the free monoid over E. For words w G E* 
we denote the length of w by licj. The set of natural numbers is denoted by IN. 
log will denote the logarithm to base 2 throughout this paper. 

Let two sets M and M' be given. The cardinality of M is denoted by {(M, the 
set of subsets of M is denoted by 2^. We will denote partial (total) mappings 
by H :C M — >■ M' {A : M ^ M'). In the latter case we write also A € 

Given a subset N C M and a relation A C AI x M' we define A{N) by A{N) = 
{m € M' I 3n G N.{n,m) G A}. 

Given partial mappings A,B,C:CM^ M' we denote the domain of A by 
dom{A) and the restriction of H to a subset N of M by A\isf. Furthermore we 
extend the inclusion, union and difference of sets to mappings by 

Ac B dom{A) C dom{B) A B\aom(A) = 

Ac B = C dom{C) = dom{A) U dom{B) AACCABCC and 

[A \ B] = C dom{C) = dom{A) \ dom{B) AC C A. 

2 Graph Colorings 

2.1 A Special Kind of Graph 

We will use rather frequently directed graphs together with ’evaluation’ map- 
pings, which will always fulfill common restrictions. To simplify matters, we will 
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thus denote these graphs and mappings by G = (V, E) and cr, respectively, where 
we will assume the following restrictions on G and cr throughout this paper. 
The set V of vertices will always be assumed to be determined by 

(Gl) V = {{i,x,u)\i G IN,x G {0,1}*, u G {0,1}*, |x| > and 

|u| = 3 - |x|}. 

Furthermore we will assume that the set E QV xV oi edges fulfills 

(G2) V(* ,x,u), (i, X, v) G V. E{i, X, u) = E{i, x, v) and 

(G3) \/{i,x,u),{j,y,v) GV. {j,y,v)GE{i,x,u)^\y\<\x\‘^'^\ 

Condition (G2) above forces edges of G to depend on pairs {i, x) rather than on 
triples (i,x,u). Let cr be a mapping a :C V x {0,1,?}^ — >■ {0,1, e} so that for 
all (i,x,u), (i,x,u') in V and all c, c' G {0,1,?}^ condition (G4) hold 

(G4) if c{j,y,v) = c'{j,y,v) for all {j,y,v) G E{i,x,u) = E{i,x,u') then 
a((i,x,u),c) = a((i, X, u), c') = a((i, x, u'), c). 

In particular, also cr does only depend on pairs (i,x) (rather than on triples 
(i, X, u)). a will be interpreted as a kind of ’evaluation’ mapping: Given the values 
of all neighbors (determined by some c) of a pair (z, x), a evaluates to ’accepting’ 
{a{{i,x,u),c) = 1) or ’rejecting’ (a((i,x,u),c) = 0) or ’error’ (a((i,x,u),c) = e). 
It is not hard to guess that cr will be given by the oracle-machines later on. 

Given (G, a) we say that c G {0, 1, ?}^ is a coloring of (z, x,u) G V in (G, cr), 
iff either 

a((i,x,u),c) yf 0, \{(i,x,u') G V\c{i,x,u') = 1}| > and 

{(z,x,zz') G V\c{i,x,u') = 0} = 0 

a{{i,x^u),c) yf 1, \{(i,x,u') G V\c{i,x,u') = 0}| > and 

{(z,x,zz') G V\c{i,x,u') = 1} = 0. 

We call c G {0,1,?}^ a coloring of (G,ct) iff for all z G IN either there exist 
(z, y,v) G V fulfilling cr((z, z/, u), c) = e or c is a coloring of almost all (z, x,u) G V 
in (G, cr), i.e. there exist G IN so that c is a coloring for all (i,x,u) with 

|x| > Tli. 



2.2 A Simple Game 

In this section we will assume that for all {i,x,u) G V there exist small neigh- 
borhoods G of {i,x,u) in G totally determining cr. We will call this small set of 
vertices ’set of critical strings’ (of (z,x) in G). We will prove that in this case 
(that is, sets of critical strings exist), there always exists a coloring of (G, cr). 
(This is a central step towards a relativized collapse because, as we will see later 
on, suitable truth-table oracles can be easily defined by colorings.) Although the 
assumption of the existence of critical sets is to strong to prove a relativized col- 
lapse of the polynomial time hierarchy, it will serve as a base for and motivation 
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of the more complex definitions and methods in the next sections. Notice that 
our proof of a relativized collapse in Section |3 does not depend on any results 
given in this Section 12.21 

To simplify matters let Bn for n £ IN denote the set of all mappings c £ 
{0,1,?}^ so that c is a coloring of (j,y,v) £ V for all j < n. Given (G,cr) 
and a triple {i,x,u) £ V let us call a set C C E{i,x,u) critical for (i,x,u), iff 
a{{i,x,u),c) = a{{i,x,u),c') for all c,c' £ Bi with c\c = c'\c- (We will call a 
mapping C \ V ^ 2^ a, critical mapping in {G,V) iff C{i,x,u) is a critical set 
of {i,x,u) for all {i,x,u) £ V .) 



Lemma 1. Let (G, tr), G 
exists a critical set C(i ^ „) 

o/(G,a). 



= (V, E) be given so that for each {i,x,u) £ V there 
of size at most . Then there exist colorings 



Proof. As we have already mentioned a{{i,x,u),c) does not depend on u. Thus 
we can choose G(i .c,«) so that for all {i,x,u),{i,x,u') £ V. 

We will construct a sequence cq C ci C . . . of mappings Ci :C V ^ {0, 1, ?} so 
that c = UiGiN G is a coloring of (G, a). For given c :C R — >• {0, 1, ?} and n £ IN 
let Bn{c) := {c' £ | c' A c}. Let cq be determined by 

co(i,x,u) =? iff (i,x,u) £ dom(co) 

iff there exist (j,y,v) £ so that |y| < 2 • |a:| and 
(t, X^ Zi) £ 

for all (i, X, u) £ V. Given c„ let Cn+i be determined by 



and 



dom{c„+i) = {{i,x,u) € V \ |a;| < n + 1} U dom{cn) 



Cn+i{j,y,v) 



1 : 3h & Bj{cn).(j{{j,y,v),b) = I 

0 : otherwise 



for all (j,y,v) £ dom{cn+i) \ dom{cn). 

Let c = UiGiN G- We will now prove c £ Bn{ci) for all £ IN (and thus 
prove the lemma). As c' £ Bo{ci) for all i £ IN and all c' £ {0, 1, 1}^ with c' A a 
we have c £ Bo^Ci). Assuming c ^ Bn+i{ci) but c £ Bn{ci) for some n,z £ IN, 
there have to exist a triple (n, x,u) G V so that c is not a coloring of (n, x, u) 
in (G,(t). In the definition of C| 3 ,|_|_i above we fix either c\^\.^i(rL,x,v) = 1 for all 
(n, x,v) G V \ dom{c\x\) or C| 2 ;|+i(n, cc, w) = 0 for all (n, x,v) G V \ dom{c\x\). As 
(n, x,v) G V \ dom{c\x\) (n, x,v) G V \ dom(co) we have C|a,|_|_i (n, x,v) =? for 
all vertices {n,x,v) G dom(cjxj). Moreover there are at most 

ji{(n,x,v) £ dom(co)} < log^^i(2|a;|) • 2\x\ ■ 2^l“l • 

^ 23log{\x\) . 22|a:| . (2.1) 

< f25l“l • 2^l^l • 2!l'^l = 



vertices fixed in ( |a;| > 2^°° by definition!), where log^^^ denotes log log log. 
Thus we may distinguish the following cases: 
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Case 1 ^{{n,x,v) G V \ c(n,x,v) = 1} > and 

{{n,x,v) G V I c{n,x,v) = 0} = 0. 

Then by construction there exist b G Bn{c\x\) so that a{{n,x,u),b) = 1. As 
C(n,x,u) C dom{c\j;\) and c G we have cr((n, x, u),b) = cr((n, x, u),c) = 1. 

(Notice: C(^n,x,u) is a critical set of (n,x,u)!) That means, c is a coloring of 
(n, X, u) in contradiction to the assumption. 

Case 2 ’i{{n,x,v) G V \ c(n,x,v) = 0} > and 

{{n,x,v) G V I c{n,x,v) = 1} = 0. 

In this case we have cr((n, x, u), &) yf 1 for all b G Bn{c\^\)- Thus (as C(^n,x,u) Q 
doTO(c| 3 ,|) and c G Bn{c\^\)) we have a{(n,x,u),c) yf 1. Again c is a coloring of 
(n, X, u) in contradiction to our assumption. 

□ 



2.3 Coloring with Exceptions 

As we have already mentioned, the conditions introduced in the previous section 
are too strong to prove a relativized collapse; Even for truth-table machines 
we cannot guarantee the existence of critical mappings. In this section we will 
therefore generalize the ideas of the proof of Lemma Q] especially the concept of 
critical mappings C. 

Let (G,a), G = (V,E) be given so that (G1)-(G4) are fulfilled. Until other- 
wise stated let C denote a mapping C :C U — >• 2^ so that 

(Cl) C(* ,x,u) C E(i, X, u), 

(C2) G{i ,x,u) = G(i, X, v) and 

(C3) t|C(i,x,u) < 

for all (i,x,u),{i,x,v) G V. Notice that we have not restricted C to determine 
a as we have done in the definition of critical mappings. We will add a similar 
condition in (G4) later on. Similar to the proof of Lemma ^ let Cq denote the 
mapping Cq :C V —>■ {0,1,?} such that 

CQ{i,x,u) =? iff {i,x,u) G dom{cQ) 

iff there exist {j,y,v) G U so that |y| < 2 • |a;| and 
(i,x,u) G G{j,y,v) 

for all (i,x,u) G V. In the proof of Lemma ^ any {i,x,u) G belongs 

to dom{cn) as long as |j/| < n and furthermore, for any such (j,y,v), there 
exist b G Bj{cn) so that 6 is a coloring of {j,y,v). These are the only features 
of c„ we have used to prove that c is indeed a coloring (beside the fact that 
c = UneiN total). Let us therefore generalize Bn first. This time we will 
take into account that, given an index i G IN, a coloring c of (G, a) have to be 
a coloring of almost all (i,x,u) G V only and might be not a coloring of any 
{i,x,u) if there exist y and v so that a{{i,y,v),c) = e. Let Z G (IN U {oo})®^, 
n G IN and & :C U — >• {0, 1,?} be given. By Bn{b,l) we denote the set of all 
b' G {0,1,?}'^, b' A Cq so that 
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(Bl) b'Db, [6'\c^](R)C{0,l}, 

(B2) if l{i) G IN then b' is a coloring of all (i,x,u) G V with |a;| > l{i) and 
(B3) if l{i) = oo then there exist G R so that a{{i,x,u),b') = e, 

for all i < n. Now we will generalize the set {c„ | n G IN}: Given I as above let 
A^{1) denote the set of all 6 :C R — >• {0, 1, ?}, 6 3 so that 

(Al) for all (i,x,u) G dom{b) \ dom{cQ) 

(Al.l) b{i,x,u)^7, 

(A1.2) (j,y,v) G dom{b) for all (j,y,v) G C{i,x,u) and 
(A1.3) if l{i) G IN and l{i) < \x\ then there exist colorings c G Bf{b,l) of 
(i, X, u) in (G, a), 

(A2) there exist m G IN such that {i,x,u) G dom{cQ) for all {i,x,u) G dom{b) 
with \x\ > m and 

(A3) for all i G IN with l{i) = oo there exist {i,x,u) G V and b' G {0, 1,?}^, 
b' A b so that a{{i,x,u),b') = e and (j,y,v) G dom{b) for all (j,y,v) G 
E{i, X, u). 

Notice that for b G A^{1) and i G IN with l{i) = oo there exists (i,x,u) G R so 
that a{{i,x,u),b') = e for all b' G {0,1,?}^ with b' A b. The following results 
can be easily verified. 

Lemma 2. Let Iq be given by lo{i) = 0 for all i G IN. Then Cq G A^{Iq). 



Lemma 3. Let n, m G IN, I G (INUloo})®^, b G A^{1) and c G B^{b, 1) be given 
so that {i, X, u) G dom(c^) or |x| < m for all {i, x, u) G dom{b). Then b' G A'^ {I') 
where 

,,,. , I c(i,x,u) : |x| < m 

b {i,x,u) = < r/- \ fi 

( Cq (z, X, u) : otherwise 



for all (i,x,u) G V and 




for all z G IN. 



i < n 
otherwise 



Now we are prepared to give the definition of weakly critical mappings: We call 
G : y — >■ 2^^ a weakly critical mapping of z G IN in (G, ct) iff (C1),(C2) and (C3) 
above are fulfilled and the following condition (C4) hold 

(C4) for all (i,x,u) G V, all / G (IN U {oo})'^ with l{i) yf oo and all b G A^{1) 
either there exist c G Bf{b,l) and {j,y,v) G V with j < i, l{j) yf oo and 
ct((j, y, v),c) = e or cr((z, x, u),c') = cr((i, x, u),c") yf e for all c', c" G Bf(b, 1) 
with C \c(i,x^u) ~ C \c(i,x,u)- 

We call C a weakly critical mapping in (G, cr) iff C is weakly critical for all z G IN 
in (G, cr). 

Lemma 4. Lf there exist weakly critical mappings in (G, a) then there also exist 
colorings of (G, a) . 
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Before we prove Lemma 0 we will state a result which will be proven in the next 
section (see Lemma EJ. 

Lemma 5. Let i G IN, C be a weakly critical mapping of all j < i in (G, a) and 
I G (INU{oo})'’^ be given. Then for all b G A^{1) so that Bf {b,l) = 0 there exist 
j < i, {j,x,u) G V and c G Bj^{b,l) so that l{j) G IN and a{{j,x,u),c) = e. 

Proof, (of Lemma 0) Let G be a weakly critical mapping in (G, u). We will 
determine sequences Cq C ci C ... and Iq, li, ... with G (IN U {oo})'^ and 
c„ G A^(l„) so that for all n G IN 

(i) Imin) = ln+i{n) for all m > n, 

(ii) (j,x,u) G dom{cn) for all {j,x,u) G V with \x\ < n and 

(iii) cr((j, X, u), b) ^ e for all (j, x, u) G Fand all b G B^ {cn, In) with j < n and 
ln{j) G IN. 

It is easy to verify that c = UnsiN ^ coloring of (G, cr), once we have 
determined (c„)„g]N so that (i) and (ii) hold. Now let cq and Iq be determined 
by Co = Cq and lo{i) = 0 for all i G IN. Given n G IN and c„, In, so that conditions 
(i)-(iii) above are fulfilled, there exist b G B^{cn,ln)' If there exist (n,x,u) G V 
and d G B^{cn,ln) so that a{{n,x,u),d) = e then take b = d. Otherwise there 
exist such b G B^{cn,ln) according to Lemma O In both cases we can find 
suitable In+i and c„+i G A'^'{ln+i) by the construction given in Lemma 0 (in 
the first case we additionally have to fix ln+i{n) = oo!). 

□ 



2.4 Combining Colorings 

Until now, we have not mentioned how to restrict a so that (weakly) critical 
mappings exist. To prove the existence of such mappings in Section 3, we have 
to tackle a variation of the following problem: Given two colorings c', c" of 
(G, cr) and two subsets V , V" of V. Does there exist a coloring c of (G,a) so 
that cjv' = c'\v and c|v" = c"|y"? In this section we will give a sufficient 
condition for the restricted case c, c',c" G B^{b,l) (for given n, b, I and G). 

First we will introduce a restricted kind of ’transitive closure’ in G. Let 
G :C U — >■ 2^ be given so that conditions (Gl) - (G3) of Section BuH are fulfilled. 
Furthermore let V' be a subset of V and n G IN be given. By we denote 

the smallest subset V” of V so that 

(Tl) V' C V" and 

(T2) C(i, X, u) C V” for all {i, x, u) G V" \ dom{cQ) with i < n. 

We can bound the number of vertices in TG‘^{V) for finite V according to 

tJT„^’<=({(f, X, u)}) < (2.2) 

We state the following lemma only because of technical reasons. In the sequel 
we will need only Gorollary0 
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Lemma 6. Let i € TN, C be a weakly eritieal mapping of all j < i in (G, tr), 
I € (IN U {oo})''^ and b € A^(l) be given. Furthermore let V' ^V" C V so that 
there exist b',b" 3 b with b',b" G dom{b') 3 V' , dom{b") D V" and 

b'\j,c,G f^y,.^^rpC,G = b" \ rpG , G f^y , .^^rpG ,G ^y „ y Then at least one of the following 
statements holds 

(i) there exist j < i, (j,x,u) € V and c G B^{b,l) so that l{j) G IN and 

= e or 

(ii) there exist c G Bf{b,l) so that c\v' = b'\v and cjv" = b"\v". 

Proof. Let G, I, b, 5', b", V , V" and i be given as above. W.l.o.g. we will assume 
that dom{b) C V . Furthermore let us assume that statement (i) is false, i.e. 
cr((j, a;, u), c) ^ e for all j < i with l{j) G IN, all {j,x,u) G V and all c G 
Bj^{b,l). Then we have a situation similar to the proof of Lemma din Section 
tZ.'Jt a{{j,y,v),c') = a{{j,y,v),c'') for all j < i with l{j) G IN and all c',c" G 
B^{b,l) with c'\c(j,y,v) = c!'\c(i,y,v)- Starting with cq = 6 we determine c„+i 
by dom{cn+i) = {(j, y,u) G R | |?/| < n + 1} U dom{cn). Furthermore given 
€ dom{cn+i) \dom{cn) let c„+i(j, y, u) = b'{j,y,v) (= b”{j,y,v)) if 

(J,y,v) G {G T^^^{V")). If (j,y,u) i then 

let c„+i(j, y,u) = 1 iff there exist d G B^{cn,l) so that a{{j,y,v),d) = 1 and 
Cn+i{j,y,v) = 0 otherwise. Let c = Uigin'^*- ''iv ~ ~ b”\v" 

by definition, we have only to prove c G B^{cn,l) for all n S IN and j < i. 
This can be verified by arguments which are almost identical to the arguments 
of Lemma n Beside the cases we have distinguished there, we have here also to 
consider a third case (j, y, v) G U 

□ 



Notice that Lemma 0 is indeed a special case of Lemma 0 where V = V" = 0. 
Furthermore we get the 

Corollary 1. Let i, C , I and b be given as in Lemma 0 so that additionally 
l{j) = oo for all {j, x,u) G V and all c G B'f {b, 1) with j < i and a{{j, x, u),c) = 
e. Furthermore let to G IN, V ,V" C V and c' ,c!' G B^{b,l) be given so that 
c \rpG.G y^y^rpG.G ^y/y = c \rj.G,G y^y^rpG.G and |y| < TO fov all G R U 

v'. 

Then there exist c G Bf{b,l) so that c\v' = c'\v' and c|y" = c"\v" . 

3 A Relativized Collapse 

In this section we will use the results of the last sections to prove a relativized 
collapse of the polynomial time hierarchy of Monte-Carlo truth-table (oracle) 
machines. Before we state this result formally, we will give some basic notations 
concerning machines and furthermore we will state a connection between oracles 
and colorings. 

We use the standard model of probabilistic multitape Turing machines with a 
finite number of tapes (see [2]). Given such a probabilistic machine M we denote 
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the time complexity and language of M by Im and Lm respectively. Furthermore 
we will assume that outputs of M on its computation paths belong to {0, 1, ?}. 
Let 7T^(x), ^(a;) denote both the different computation paths 

of M on input x € E* and their outputs. Given a predicate Q on computation 
paths let Pm,x[Q] denote the set Pm,x[Q] = {0 < p < mM{x)\Q{TT^{x))}. For 
example we have x G Lm ‘^Pm,x[t^\i{x) = 1] > ^mM{x). 

We say that M respects the Monte-Carlo property on x {M G MCM{x) for 
short) iff either '^PM,x[^h{^) = 1] > |wM(a:) or 'iPM,x[^\f{x) = 0] > 1mM{x). 
M is called a Monte-Carlo machine (M G MCM) iff M G MCM{x) for all 
X G E*. A probabilistic machine is called Las- Vegas machine, iff for all x either 
t)^’M,x[7r^(a;) = 1] > ^mM{x) and ^PM,x[TrM(x) = 0] = 0 or 'iPM,x['^M(x) = 
0] > ^mM{x) and IPm,x[t^m{x) = 1] = 0. 

Probabilistic Turing machines can be relativized in a canonical way. Here we 
will assume that an oracle may also answer ?, that is an oracle is a mapping 
A \ E* ^ {0, 1, ?} rather than a subset A Q E* . 

Given an oracle A we adopt the above definitions and denote the relativized 
versions by rnM{x,A), tt^{x,A), MCM{x,A) and MCM{A). Fur- 

thermore we also adopt Pm,x [Q] and denote its relativized version by Pm x [Q] ■ 
Given a computation path tt of an oracle machine let g(7r) denote the set of 
oracle questions asked during the computation of tt. We call a probabilistic or- 
acle machine M a truth-table machine iff for all x G E* and all oracles A, B : 
E* — >• {0, 1, ?} we have mM{x, A) = mM{x, B) and q{Tr^{x, H)) = q{Tr^{x, B)) 
for all 0 < p < mM{x, A). In the case of truth-table machines we will omit 
the oracle A iff it is clear. We denote the time complexity classes determined by 
Monte-Garlo and Las- Vegas truth-table machines with oracle A by BPTIME^O 
and ZPTIME^O respectively. Monte-Garlo polynomial time is also denoted by 
BPPti- 

Let (•,•,•) : IN X V* X V* — >■ V* be determined by (i,x,u) = for all 

* G IN and x,u G E* . Furthermore let V be determined as in Section l?Tl a,nd Aq 
be a mapping Aq :C E* — >■ {0,1,?} so that dom{Ao) = {{i,x,u) \ (i,x,u) G V}. 
Once we have fixed such an Aq (in the sequel we will assume so) we have a one 
to one correspondence between cG {0,1,?}'^ and oracles A : E* — >• {0,1,?} with 
A D Aq: given c G {0, l,?}'^ let A,. D Aq be determined by A{{i,x,u)) = c{i,x,u) 
for all (i, X, u) G V. Given a computation path tt we will denote by q(7r) the set 
{(i,x,u) G Vl(i,x,u) G q(7r)} and by g(M,x) the set Uo<p<mM(0 
given truth-table machine M and input x). 

Now we are prepared to prove the main lemma. 

Lemma 7. There exist e > 0 and A : E* ^ {0,1,?} so that 
BPTIMEft{n^+‘^) = ZPTIMEftin). 

Proof. Let 0 < e < 1 and an enumeration Mq, Mi, ... of (clocked) probabilistic 
truth-table machines be given so that t^.{x) < \x\^ for all i G IN, a: G E* 
and all oracles A : E* — >■ {0,1,?}. Furthermore we will assume that for all 
A : E* ^ {0, 1, ?} and all L G BPTIME^(n^~^^) there exist ul G IN such that 
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£ MCM{A) and x £ L ^ x £ for almost all x £ S*. The existence 

of such an enumeration can be easily verified by standard techniques. 

In the sequel we will determine (G, cr), G = (V, E) so that 

E{j,y,v) D {{i,x,u)\{i,x,u) £ q{Mj,x)} (3.3) 

r 1 : M, £ MCM{y, A,) Ay £ L^i, 

^((j,2/,^'),c)= 0 : M,£MCM{y,A,)AyiLii. (3.4) 

I e : otherwise 

for all (j, y,v) £V and c £ {0, 1, ?}'^. Notice that a is uniquely determined by E 
and (E3) above. Thus we will give only a construction of G = (V, if) and denote 
the corresponding a according to (13.411 by erg. 

The following claim can be easily verified. 



Claim. Let (G, a) be given so that (t3.3ll and (1.3.41 are fulfilled and there exists 
a coloring c of (G, cr). Then BPTIMEf^'={rA+^) = Z PT I M Ef^<^ (n) . 

Thus we have to construct (G, cr) so that weakly critical mappings exist. 
Next we will determine a sequence Eq Q Ei Q ... and furthermore a sequence 
Go C Gi C ..., Ci :C V ^ 2^ for all f G IN, so that G = UneiN ^ weakly 
critical mapping in (G,ag), G = (V,E) where E = Furthermore we 

will determine G„ so that dom{Cn) = {{i,x,u) £ V\i < n}. 

Let Eo{j,y,v) = q{Mj,y) for all (j,y,v) G V and Go be determined by 
dom{Co) = 0. Notice that we have |x| < |?/p for all {i,x,u),{j,y,v) G V with 
(f, X, u) £ Eq{j, y, v). Given n G IN, En and G„ we will first determine Cn+i - Let 
{n,x,u) G R be given and M = M„. Furthermore let V{p) = T^”-’^”-{q{n^{x)) 
for 0 < p < rriM (x) . Then determine G„+i (n,x,u) by G„+i {n,x,u) = { (j, p, w) G 
V I HEM,xl(j,y,v) £ V{p)\ > (4 • maxo<p<™j^(3,) t)I^(p))"^ • mM{x)}. Now let 

P E„{j,y,v) : j^n 

n+i[J,y, yEn{j,y,v)UCn+i{j,y,v) : otherwise 

for all {j, y, v) £ V. 

To complete this proof we have to verify that G = (JneiN is a weakly 
critical mapping in (G, erg), G = (R, E) where E = (JneiN En- 

Claim. Let G, trg and G be given as above. Then G fulfills conditions (G1)-(G4) 
of Section EH and G is a weakly critical mapping in (G,crg). Thus there exist 
colorings c of (G,crg). 

Proof. It can be easily verified that (G1)-(G4) and (G1),(G2),(G3) hold for 
{G,ac) and G respectively. To prove (G4) let f G IN, Z G (IM U {oo})'^ and 
h G ^*"(0 be given so that for all {j,y,v) £ V with j < i and l{j) yf oo 
and for all c G Bf{b,l) we have crg((j, p, u), c) yf e. Furthermore let us assume 
that there exist (i,x,u) £ V and c',c" G B^{b,l) so that ac{{i,x,u),c') = 0, 
ac((i,x,u),c'') = 1 and c'\c{i.x,u) = c''\c{i,x,u)- We will now prove the existence 
of c G Bf (&, 1) so that cr((i, x,u),c) = e and thus verify (G4). 

Given p with 0 < p < mMi (x) let V (p) be determined by 
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y{p) = 

Furthermore given d G {0,1,?}'^ let denote the set P£ = PMi,x[d\v{p) = 
c'\v(p)\ and P^ = {0, mMi{x) — l}\Pl. Let ||d, c'|| be determined by ||d, c'|| = 
Now let d G Bf{b,l) be given so that d\c(i^x,u) = d\c{i,x,u), a((i,x,u),d) = 
1 and ||d,c'|| is minimal among these mappings. Because of a{{i,x,u),d) = 1, 
a((i,x,u),c') = 0 and by Corollary ^ there exist p G P^ and c G 
so that c\v = d\v and c\v" = d\v" where V = V{p) U {}{V{p)\p G Pt} 
and V" = {_}{V{p)\p G P'^ /\V{p) C\ V{p) C C{i,x,u)}. Obviously we have 
||c,c'|| < ||d,c'|| and c\c(i,x,u) = c'\c{i,x,u)- Furthermore we have 

'i.PMiA<^\v(p) ^ c|v(p)}] < 4p^tt^(p) • vhmAx) = i • mM,(a;) 

and thus iPMi,x[^MiA ’ = 1] ^ ' xanAx) - j ■ mnAx) > | • mu Ax) 

so that a((i,x,u),c) = e by minimality of ||(i, c'||. 

□ 



4 Conclusions and Open Problems 

In this section we will first give a few conclusions of Lemma Q for further dis- 
cussion see [5]. By standard padding arguments we get the main theorem: 

Theorem 1. There exist oracles A such that 

BPPA = ZPTIMEAAn). 

Notice that our proof of Lemma 0 and thus of A is not constructive because 
the construction of Cn,+i in Lemma 0 is not. To solve this problem we will first 
assume In+i A (see prove of Lemma EJ until we recognize that our guess is 
wrong. A carefull analysis of the construction shows that for all (i,x,u) G we 
can find a step so that A need not be altered later on. Thus the oracle A can be 
chosen to be recursively. 

Furthermore by a carefull analysis of equation 112. 1 II (Section r2.2ll and the 
proof of Lemma 7 we can weaken the time bound of the machines (M^iGiN 
in the proof of Lemma 0 so that tMi G 0{f) for all i and a function / with 
\/k,i G G 0(2’^). Together we can extend Theorem 1 to the following 

theorem. 

Theorem 2. Let f be a function such that'ikA G IN./^^^ G 0(2^). Then there 
exists a decidable oracle B so that 

BPTIMEfAf) = ZPTIMEAAn). 

Obviously Theorem 1 is not true for all oracles: even under random oracles 
BPP^ A BPTIME^fn). The same is true under any oracle C with P’^ = 
NP^. 

For general (Monte-Carlo) oracle machines and a few other semantic models 
it is also possible to define appropriate graph coloring problems. But in these 
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cases graphs and (weakly) critical mappings depend on colorings and vice versa. 
An interesting open problem is to find simpler proofs of such relativized collapses. 

The central problem remains to prove or disprove the unrelativized existence 
of polynomial time hierarchies for Monte-Carlo machines (and several other so 
called semantic machine classes). However, the results given in this paper and in 
[10] indicate that the separation of the time hierarchy classes will be a non-trivial 
task. Notice that even strong relations to other problems are not known. 



Acknowledgments. The authors would like to thank Marek Karpinski for 
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Abstract. We show that some natural refinements of the Straubing 
and Brzozowski hierarchies correspond (via the so called leaf-languages) 
step by step to similar refinements of the polynomial-time hierarchy. 
This extends a result of H.-J. Burtschik and H. Vollmer on relationship 
between the Straubing and the polynomial hierarchies. In particular, this 
applies to the boolean hierarchy and the plus-hierarchy. 



1 Introduction 

In complexity theory, the so called leaf-language approach to defining complexity 
classes became recently rather popular. Let us recall some relevant definitions. 
Consider a polynomial-time nondeterministic Turing machine M working on an 
input word x over some alphabet X and printing a letter from another alphabet 
A after finishing any computation path. These values are the leaves of the binary 
tree defined by the nondeterministic choices of M on input x. An ordering of the 
tuples in the program of M determines a left-to-right ordering of all the leaves. In 
this way, M may be considered as a deterministic transducer computing a total 
function M : X* — >■ A* from the set of words X* over X to the set of words over 
A. Now, relate to any language L C A* (called in this situation a leaf language) 
the language M~^{L) C X* . Denote by Leaf{L) the set of languages 
for all machines M specified above. For a set of languages C, let Leaf{C) be the 
union of Leaf{L), for all L G C. 

It turns out that many inportant complexity classes have natural and useful 
descriptions in terms of leaf languages (see e.g. [BCS92, V93, HL-l-93). In par- 
ticular, a close relationship between some classes of regular leaf languages and 
complexity classes within PSP ACE was established in [HL-l-93]. In [BV98], a 
close relationship between the Straubing hierarchy {iS„} and the polynomial 
hiearachy {Xp} was established: Leaf{Sn) = for any n > 0. 

In this paper, we consider the possibility of extending the last result to some 
refinements of the above-mentioned hierarchies (in the context of complexity the- 
ory, these refinements were introduced and studied in [Se93, Se94, Se95, Se99j). 

* Supported by the Alexander von Humboldt Foundation, by the German Research 
Foundation (DFG) and by the Russian Foundation for Basic Research Grant 00-01- 
00810. 
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Our interest in results of this kind is motivated by the hope that such relation- 
ships between automata-theoretic and complexity-theoretic notions may lead 
to nontrivial applications of one theory to the other. For an example of this see 
Section 7 below. 

Note that for the important particular case of the boolean hiearachy over 
NP a result similar to ours was earlier established in [SW98, Theorem 6.3], and 
we actually use the idea of proof of that theorem. We make also an essential use 
of a result from [PW97] cited in Section 3. 

In Section 2 we give the exact definitions of our hierarchies, in Section 3 we 
consider some relevant notions from language theory, in Sections 4 — 6 we present 
our main results, and further we give some examples and discussions. 



2 Hierarchies 

In different areas of mathematics, people consider a lot of hierarchies which are 
typically used to classify some objects according to their complexity. Here we 
define and discuss some hierarchies relevant to the topic of this paper. 

We already mentioned the polynomial hierarchy which is one of the 

most popular objects of complexity theory. Note that classes (or levels) of the 
polynomial hierarchy are classes of languages over some finite alphabet X. In the 
context of complexity theory, the cardinality of X is not important (provided 
that it is at least 2), so it is often assumed that X is just the binary alpha- 
bet {0,1}. For detailed definition and properties of the polynomial hierarchy 
and other relevant notions see any standard textbook on complexity theory, say 
[BDG88, BDG90]. Sometimes it is convenient to use more compact notation for 
the polynomial hierarchy, namely PH\ hence PH = {Sp}. 

Let us define now two hierarchies which are rather popular in automata 
theory. A word u = uq ■ ■ - Un G A~^ denotes the set of finite nonempty words 
over an alphabet A, while A* — the set of all finite words over A, including the 
empty word e) may be considered as a first-order structure u = ({0, . . . , n}; < 
) Qa) • ■ Oj where < has its usual meaning and Qa{a G A) are unary predicates on 
{0, . . . , n| defined by Qa{i) ^ Ui = a. By Theorem of McNaughton and Papert 
[MP71], the so called star-free regular languages are exactly the sets of words 
u, for which u satisfies a first-order sentence of signature aA = {<, Qa, }a&A- 

For any n > 0, let denote the class of languages defined by A^-sentences 
of signature a (i.e., sentences in prenex normal form starting with the existential 
quantifier and having n — 1 quantifier alternations); the sequence SH = {5„} 
is known as the Straubing (or Straubing-Therien) hierarchy. In cases when it is 
important to specify the alphabet, we denote the n-th level as A+£„, and the 
whole hierarchy as A~^SH. There is also a *-version of the Straubing hierarchy 
which will be denoted as A*SH = {A*5„}; the relationship between both ver- 
sions is very easy: A*Sn = A+5„ U A+5^ (for any class X of subsets of A+ we 
denote X^ = {XVJ {e}\X G X}). 

The Brzozowski hierarchy is defined in the same way, only in place of a a one 
takes the signature cr(^ = cr^ U {T, T, s|, where T and T are constant symbols 
and s is a unary function symbol (_L, T are assumed to denote the least and the 
greatest element respectively, while s denotes the successor function) . Brzozowski 
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hierarchy will be denoted by BH = {Bn}, with the corresponding variations in 
case when we need to mention the alphabet explicitely. 

Note that in automata theory people usually define the Straubing and Br- 
zozowski hierarchies by means of regular expressions; the equivalence of those 
definitions to definitions used here is known from [T82, PP86]. For more infor- 
mation on logical aspects of automata-theoretic hierarchies see also [SeOl]. 

Next we would like to define some refinements of the introduced hierarchies. 
In order to do this in a uniform way, we need a technical notion of a base. Let 
(i?;U,n,“0, 1) be a boolean algebra (b.a.). Without loss of generality, one may 
think that B is a, class of subsets of some set. By a base in B we mean any 
sequence L = {L„}n<u, of sublattices of (i?;U,n,0, 1), satisfying the inclusions 
L„Uco(L„) C Ln+i (here co{Ln) denotes the dual set [x\x G L„} for L„). Note 
that levels of the hierarchies introduced above (as well as of many other popular 
hierarchies) are bases (take in place of L„ respectively N'^+i, 5„+i and Bn+i)- 
With any base L = {Ln}n<u; one can associate a sequence of new subsets 
of B as follows. Let T be the set of terms of the signature (U,n,“, 0,1) with 
variables v}}{k,n < to). We call < to) variables of type n, and elements of 
T — typed boolean terms. Relate to any term t G T the set t{L) of all its values 
when variables of type n range over L„. We call the sequence {t{L)}t the typed 
boolean hierarchy over L. 

Let us state some easy properties of the introduced classes. 

Lemma 1. 

(i) Typed boolean hierarchy is a refinement of L, i.e. any class Ln is among the 
classes t{L),t G T. 

(ii) If L is a base in B, L' is a base in B' and f : B ^ B' is a homomorphism of 
boolean algebras such that f{Ln) C L'^ for all n < uj, then f{t{L)) C t{L') 
for any t G T. 

Proof. 

(i) Let t = Vq, then t(L) = Ln- 

(ii) We have to deduce f(a) G t{L') from a G t{L). Let t = t{xo, ■ ■ ■ ,Xk), where 
Xo, . . ■ ,Xk are some typed variables. By definition of t{L), a = t{ao, . . . , Ofc) 
for some oq, . . . ,ak G B such that aj G L„ whenever xj is of type n. By a 
property of homomorphisms, /(a) = t(/(oo), . . . , /(ofc)). From /(L„) C 

it follows that f{aj) G L'^ whenever Xj is of type n. Hence, /(a) G t{L'). 
This completes the proof. 

Now we prove a more subtle property useful for some considerations below. 
Let L be a base in a b.a. B, and let B' = B x {0, 1} be the cartesian product 
of B and of the 2-element b.a. {0, 1}. Hence, B' = {{b,c)\b G H,c < 1} and the 
boolean operations in B' are componentwise. We will state a close relationship 
between the typed boolean hierarchies over L and over the base L' in B' defined 
by L'n = LnX {0, 1}. 

Let B‘^ = B X {c}, then B' = B^ B^ and may be considered as a b.a. 
isomorpic to B (isomorphism is given of course by the function b i— >■ (6, c); note 
that symbols fl, U have the same interpretations in as they have in B' , while 
the symbol “has slightly different interpretations in B^ and B^). Let be the 
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base in B‘^ induced by L under this isomorphism, i.e. = L„ x {c}. Then 

We call a boolean term t{xo, . . . ,Xk) nontrivial, if the function induced by 
this term on {0,1} is not constant, i.e. for any c < 1 there are Cj < 1 satisfying 
t(co, . . . , Ck) = c. As is well known, a term is nontrivial iff it is not equivalent to 
the constant boolean terms 0, 1 in the theory of boolean algebras. 

Lemma 2. 

(i) For any t gT and c < 1, = t{L) x (cj. 

(a) For any nontrivial t G T, t{L') = t{L^) U t{L^). 

Proof. 

(i) follows from Lemma 1. 

(ii) Let t = t{xo, . . . ,Xk) for some typed variables Xj, and let a G t{L'),a = 

{b,c) G B'. Then a = t{{bo,co), . . . ,{bk,Ck)) for some (bj,Cj) G B' 

such that (bj,Cj) G Ljj whenever Xj is of type n. We have a = 

(t(6o, . . . , 6fe), t(co, . . . , Cfe)), and bj G whenever xj is of type n. Hence, 
b G t{L) and a G t{L) x |c} = t{L‘^). 

It remains to check the inclusion t{L) x {cj C t{L'). By nontriviality of t, 
c = t{co, . . . , Cfc) for some Cj < 1. Let a = {b, c) G t{L) x (cj, then b = t{bo, . . . , bk) 

for some bo, ... ,bk G B such that bj G L„ whenever Xj is of type n. We have 

a = t{{bo,co), . . . ,{bk,Ck)) and (bj,Cj) G Ljj whenever xj is of type n. Hence, 
a G t{L') completing the proof. 

Taking in place of L the base PFI = we get the typed Boolean 

hierarchy {t{PF{)} over PFI introduced and studied in [Se94, Se94a, Se95, Se99]. 
In particular, the following fact was established. 

Lemma 3. All classes of the typed boolean hierarchy over PH are closed down- 
wards under the polynomial m-reducibility and contain polynomially m-complete 
sets. 

Taking in place of L the base SH (the base BH), we get the typed boolean 
hierarchy {t{SH)} (resp., {t{BH)}) over SH (resp., over BH). A study of these 
hierarchies, as well as of fine hierarchies over SH and BH defined below was 
initiated in [SSOO]. If we want to specify the alphabet explicitly we again use 
notation like t{A^SH). 

The three examples of the typed Boolean hierarchy introduced above will be 
the main objects of this paper. Let us state some easy facts on the typed boolean 
hierarchies over SH and BH . The foolowing assertion follows from Lemma 1. 

Lemma 4. 

(i) For any n > 0 and any alphabet A, A+5„ C A*Sn and A+5„ C A'^Bn- 

(ii) For any t G T and any alphabet A, t{A~^SH) C t{A*SH) and t{A^SH) C 
t{A+BH). 

Now we establish a relationship between the *- and +-versions of the typed 
boolean hierarchy over SH which informally means the equivalence of the both 
versions for the purposes of this paper. 
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Lemma 5. 

(i) For any nontrivial t G T and any alphabet A, t{A*SH) = t{A~^SH) U 
t{A+SHy. 

(a) For any t gT and any alphabet A, Leaf{t{A*SF[)) = Leaf{t{A~^SF[)). 

Proof. 

(i) Let B = P(A+) and L = A^SF[. Then b.a. P{A*) is naturally isomorphic 
to the b.a. B' = B x {0, 1} (the isomorphism h : i? x {0, 1} — >■ P{A*) is 
defined by h{L,0) = L and h{L, 1) = L U {e}). Using notation introduced 
before Lemma 2, we have h{L'^) = A*Sn- Now the assertion follows from 
Lemma 2. 

(ii) If t is equivalent (in the theory of boolean algebras) to 0, then t{A*SF{) = 
{0} = t{A~^SP[), and the equation follows. If t is equivalent to 1, then 
t{A*SH) = {A*} and t{A+SH) = {A+}. But Leaf (A*) = X* = Leaf{A+), 
because the leaf word is always nonempty. Finally, let t be nontrivial. Then 
the equation follows from (i), because Leaf{t{A^SF[y) = Leaf{t{A~^SP[)), 
again by the nonemptyness of the leaf word. This completes the proof of the 
lemma. 

We will consider also another hierarchy called the fine hierarchy over L. It was 
introduced by the author in the context of recursion theory and then considered 
also in several other contexts. Let us briefly recall the definition of the fine 
hierarchy over L. Its classes (or levels) Sa are numbered by ordinals a < eg, 
where Sq = sup{io , uj‘^ , , ■ • •} (for more information about the well-known 

ordinal Eq and the ordinal arithmetic see e.g. [KM67]). 

We define the classes S'”, where n is an auxiliary parameter, by induction on 
a as follows (simplifying notation we write in this definition ab in place of an&): 
So” = 0 ; S”^ = S”+i for 7 > 0 ; 

S^+i = {uqXq U UiXi\ui G L„,Xo G S”,a;i G co(S”), UoWia^o = uqUiXi}] 
Ss+u.'i = {ut)Xo UiXi \j UoUiy\ui G Ln,Xo G S”,Xi G co{S^),y G 
Sg, uqUiXo = uoMiXi} for 5 = • d' > 0, 7 > 0 . 

To see that this definition is correct note that every nonzero ordinal a < Eq 
is uniquely representable in the form a = ui'^° + • • • + w'*''” for a finite sequence 
7o > ■ ■ ■ > 7fc of ordinals < a. Applying the definition we subsequently get 
SZ 10 , SZ~,o+u>~n Finally, let = S°. 

For information on the properties of the fine hierarchy see e.g. [Se95]. 

3 Families of Languages 

By a +-class of languages [PW97] we mean a correspondence C which associates 
with each finite alphabeth A a set A+C C P(A+), where P(A+) is the set of 
all subsets of A+ . In this paper we need classes of languages with some minimal 
closure properties as specified in the following 

Definition. By a +-family of languages we mean a +-class C = {A'^C}a 
such that 

(1) for every semigroup morphism (p : A+ — >■ P+, L G B^C implies G 

A+C; 
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(2) if L G A~^C and a G A, then a = {v G A~^\av G L} and La ^ G 
A~^\va G L} are in A'^C. 

This notion is obtained from the notion of a positive +-variety introduced 
in [PW97] by omitting the condition that any A'^C is closed under finite union 
and intersection. The notion of a *-family of languages is obtained from the 
above definition by using * in place of + and monoid morphism in place of the 
semigroup morphism (as again in [PW97] for the notion of a positive *-variety). 
The following evident fact will be of some use in the next section. 

Lemma 6. Let C he a *-family of languages and A, B be any alphabets of the 
same cardinality. Then Leaf{A*C) = Leaf{B*C). 

Proof. By symmetry, it suffices to check the inclusion in one direction, say 
Leaf{A*C) C Leaf{B*C). Let K G Leaf{A*C), then K = M~^{L) for an 
L G A*C and a suitable machine M. Let (f> : A ^ B he a one-one correspondence 
between A and B, and (j>i : B* -G A* be the monoid morphism induced by 
Then Li = (f>f^{L) G B*C and K = Mff^{Li), where M\ is a machine behaving 
just as M with the only difference that it prints <f{a) whenever M prints a. 
Hence, K G Leaf{B*C) completing the proof. 

From results in [PW97] we easily deduce the following facts about classes of 
hierarchies introduced in Section 2. 

Lemma 7. 

(i) For any n > 0, o,nd {A~^Bn}A are positive +-varieties, while 

is a positive *-variety. 

(ii) For any typed Boolean term t, {t{A'^SH)}A and {t{A~^ BH )}a are +- 
families of languages while {t{A* SL [)}a is a *-family of languages. 

(Hi) For any a < e, {Sa{A~^ SH )}a and {Sa{A'^BH)}A are +-families of lan- 
guages while {Sa{A~^ SF [)}a is a *-family of languages. 

Proof. 

(i) s proved in [PW97] and plays a principal role for our paper. 

(ii) Let <j) : A~^ -A B^ be a semigroup morphism and let L G t{B^S). By (i), 
the preimage map 4>~^ satisfies conditions of Lemma 2.(ii). Hence, 4>~^{L) G 
t(A'^S). Property (2) from definition of the family of languages, as well as 
the remaining assertions from (ii) and the assertion (iii), are checked in the 
same way. 

4 Typed Boolean Hierarchy over SII 

In this section we relate some hierarchies introduced in Section 2 via the leaf 
language approach. First we consider languages from classes of the typed boolean 
hierarchy over S'iJ as leaf languages. 

Theorem 1. For any typed boolean term t, L)ALeaf(t{A*SH)) = t{PF[) = 
UALeaf{t{A^SH)). 
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Proof. By Lemma 5, it suffices to prove the equality UALeaf{t{A*SH)) = 
t{PH). First let us note that the result from [BV98] cited in the Introduction is 
exactly formulated as the equality UALeaf{A*Sn) = for any n > 0. 

Now let us check the inclusion UALeaf{t{A*SH)) C t{PH). Let K G 
Leaf{t{A*SH)), then K = M~^{L) for some polynomially bounded nondeter- 
ministic Turing machine M and some L G t(A*SH). The map M~^ : P(A*) — >■ 
P{X*) is a homomorphism of boolean algebas satisfying (by the Theorem of 
Burtschick and Vollmer) the inclusions M~^{A*Sn) C IJp. By Lemma l.(ii), 
K = M~^{L) G t{PH), as desired. 

For the converse inclusion, choose any K in t{PH) and let t = t{xo, ■ ■ ■ , Xk), 
where Xj are typed variables. Then K = t{Ko, . . . , K^) for some Kq, . . . , C 
X* such that Kj G P^n+i whenever Xj is of type n. By the Theorem of Burtschik 
and Vollmer, there exist alphabets To,...,Afe and languages Lj C A* such 
that Kj G Leaf{Lj) and Lj G 5„+i whenever Xj is of type n. By Lemma 3.2, 
the alphabets Aq, ... ,Ak may be without loss of generality assumed pairwise 
disjoint. Let A = AqU- • -UAfe. Now it suffices to show that K G Leaf{t{A*SH)). 

Let Mq , . . . , Mk be nondeterministic polynomyal time Turing machines sat- 
isfying Kj = M~^{Lj). Consider the nondeterministic polynomial time Tur- 
ing machine M which behaves as follows: on input x G X* , it branches non- 
deterministically into k + 1 computation paths, and on the j-th (from left to 
right) path just mimicks completely the behavior of the machine Mj. Note that 
the leaf string M{x) will be the concatenation of the leaf strings Mj{x), i.e. 
M{x) = Mq{x) ■ ■ ■ Mk{x). 

For any j < k, let (j)j '. A* ^ A* be the morphism erasing all letters not in 
Aj. Then, by Lemma 3.3, (f>J^{Lj) G T„+i whenever Xj is of type n. Hence, the 
language P = . . . , is in t{A*SH). Hence, it suffices to check 

that K = M~^{P) = But <pj{M{x)) = 

Mj{x), hence M~^{(j)J^{Lj)) = M~^{Lj) and the desired equality follows im- 
mediately from the equality K = t{KQ , . . . , K^) = . . . , 

This concludes the proof of the theorem. 

In [BV98] the result was proved in a more exact form than it was formulated 
above. They proved also that for any n > 0 there is an alphabet A and a 
language L G such that Leaf{L) = S^. This is also generalizable to the 

typed boolean hierarchy. 

Theorem 2. For any t G T there exist an alphabet A and a language L G 
t{A~^SH) such that Leaf{L) = t{PH). 

Proof. By Lemma 3, there exists a language K C X* polynomially m-complete 
in t{PH). By Theorem 4.1, there exist an alphabet A and a language L G 
t{A~^SH) such that K G Leaf{L) C t{PH). It is well-known [BCS92] that the 
class Leaf{L) is closed downwards under the polynomial m-reducibility. Hence, 
t{PH) C Leaf{L) completing the proof. 

5 Typed Boolean Hieararchy over BH 

The next result is an evident analog of Theorem 1 for the Brzozowski hierarchy. 
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Theorem 3. For any t GT, t{PH) = UALeaf(t{A'^BF[)). 

The relationships between automata-theoretic hierarchies and the 
complexity-theoretic ones established in Theorems 1 and 3 look dependent on 
the alphabet. It seems that for the Straubing case the dependence is really essen- 
tial (though we have yet no formal proof of this) . Our next result shows that for 
the Brzozowski case one can get an alphabet-independent version of Theorem 3. 

Theorem 4. For any t G T and any alphabet A having at least two symbols, 
Leaf{t{A+BH)) = t{PH). 

The idea of proof is evident: to code symbols of a bigger alphabet by sequences 
of symbols of a smaller alphabet using the presence of the successor function in 
the signature from Section 2. In the next few lemmas we collect observations 
needed for realization of this idea. For technical convenience, we will assume in 
these lemmas that the alphabet A is a finite subset of w. 

Define a function / : w — >• {0, 1}"*" by /(n) = 01 ... 10, where the sequence 
of I’s is of length n -I- 1. With any alphabet A C w we associate a semigroup 
morphism f = f a '■ A~^ — >■ {0, 1}+ induced by the restriction of / to A. E.g., 
for A = {0, 1,2} and w = 0212 we get f{w) = 01001110011001110. In general, 
if w = ao ■■■ Ok for Oj G A then f(w) is the superposition f(ao) ■ ■ ■ f{ak)- For 
i < k, let i' denote the position of the first letter of f{aj) (this letter is of course 
0) in the word f{w). As usual, the length of a word v is denoted by |w|, and for 
f < |u| the f-th letter in v is denoted by Vi. The following assertion is evident. 

Lemma 8. 

(i) For all i,j < |w|, i < j f . 

(a) For any I < \ f{w)\, I G {i'\i < |w|| iff {f{w))i = 0 and {f{w))i+i = 1. 

Let a A = {<,Qa}aeA and a' = C{o,i} = {<, Qo, Qi, -L, T, sj be the signa- 
tures discussed in Section 2. Relate to any formula (j) of a a & formula (f)' of a' by 
the following induction: 

if (() is a; = 2/ or a; < y then (j)' is </>; 

if (j) is Qa{x) then (j)' is Qo{x) A Qi(s(a;)) A • • • A Qi(s“+^(a;)) A (5o(s“+^(a;)); 
if (() is </>i A 02 then 0' is 0} A 02; 
if 0 is -101 then 0' is “i0}; 

if 0 is Va;0i then 0' is 'ix{Qo{x) A (5i(s(a:)) — >• 0(). 

The other connectives V,— >-,3 are expressed through A,-i,V in the usual 
way. Below we discuss structures of the form w related to words w as specified 
in Section 2. When we want to stress the signature in which we consider the 
structure, we use notation like (w;cr^). For fo,...,ifc < (w; cr^, to, . . . , ifc) 

denotes the usual enrichment of the structure (w; cta) by constants ij. From 
Lemma 8 we easily get 

Lemma 9. 

(i) Let 0 = (j){xo, . . . ,Xk) be a formula of signature aA, w G A+ and 
io,...,ik < |i«|. Then we have (w; CTo, to, . ■ • , 4) h 4>(xo, . ■ . ,Xk) iff 
(f(w);cr',to, . . . ,t}) h (f'{xo,...,Xk). 
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(a) For any w € A+ and any sentence 4> of a a, we have (w; cta) \= iff 
(f(w); cr') 1= (f)'. 

Let F be the set of all sentences of signature a a - For f/) G let = {li; G 
A+|w \= <p} and let <j) = if iff = L^. Let B be the quotient of the structure 
(tP; A, V, -I, true, false) under the congruence relation =. As is well-known, B is 
a boolean algebra; abusing (and simplifying) notation, we denote elements of B 
as corresponding sentences. For n < w, let L„ be the subset of B corresponding 
to the set of Ifn+i-sentences; then L = {L„} is a base in B. When we want to 
stress the alphabet, we denote this base by La- 

The construction from preceding paragraph applies also to sentences of a'^ 
in place of cr^ (as well as to sentences of any other signature). We denote the 
corresponding base as L' = L'j^. The next lemma follows from Lemma 1. 

Lemma 10. Let t G T and A be any alphabet, AQui. 

(i) L G t{A^SH) iff L = L^ for some (p G t{LA), and similarly for A^BH and 
for the base L'^. 

(a) If 4> G t{LA) then (p' G t{L'^Q ^j). 

Now we are able to give a proof of the theorem. 

Proof of Theorem 4. The inclusion Leaf{t{A^BH)) C t{PH) was already 
noticed above. For the converse, it clearly suffices to check that t{PH) C 
Leaf{t{{0,l}~^BH)). By Theorem 1 and Lemma 6, it suffices to check the in- 
clusion Leaf{t{A^SH)) C Leaf{t{{0, 1}~^BH)), for any alphabet A C w. So 
assume that K G Leaf{t{A~^SH)). Then K = M~^{L) for a suitable machine 
M and for some L G t{A^SH). By Lemma 10, L = L,p for some <p G t{LA)- 
Again by Lemma 5.5, p' G t(L{Q i}) and L^/ G t({0, 1}^BH). Hence, it remains 
to check that M~^{L^) G Leaf{Lfj,i). In other words, it remains to find a suitable 
machine Mi satisfying M~^{L^) = M~^{L,p/). 

Let Ml on the input x behaves exactly as M on input x, with the only 
exception that at the end of any computation path, whenever M prints a G A, 
the machine Mi ’’prints” the word /(a) G {0,1}“'' (more exactly. Mi branches 
nondeterministically to |/(a)| paths and on these paths prints subsequently bits 
of the word f{a)). In this way we get Mi{x) = f{M{x)). By Lemma 9, M{x) |= p 
iff Mi{x) \= p'. In other words, x G M~^{Lf) iff x G Mf^{L^i), for any x G X* . 
Hence, M~^{L^) = M~^{L,pi) completing the proof of the theorem. 

As in Section 4, we automatically get 

Corollary 1. For any t GT there exists a language L G t{{{0,l}* BH)) satis- 
fying Leaf{L) = t{PH) . 



6 Fine Hierarchy 

In two proceeding sections we have described the situation with the typed 
boolean hierarchy rather comprehensively. Here we discuss similar questions 
for the fine hierarchy. The general question is to understand the relationships 
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between classes Leaf{Sa{A'^SH)), Leaf{Sa{A~^BH)) and Sa{PH), for any 
a < £o- We give some partial information relevant to this question. 

The proof of the next fact is strightforward. 

Proposition 1. For any a < Eq and any alphabet A, Leaf(Sa(A~^SH)) C 
Leaf(Sa(A+BR)) C Sa(PB). 

Unfortunately, till now we were unable to prove the remaining inclusion 
Sa{PH) C Leaf{UAA~^Sa{SH)). The problem is that the fine hierarchy is de- 
fined in terms of values of boolean terms, values of whose variables satisfy some 
constraints (see Section 2) . It is not clear how to preserve those constraints under 
transfer to another hierarchy. 

The next result, which is reminscent of Theorem 4 and which is proved in the 
same way, shows that for the Brzozowski case we again may ’’reduce” alphabets. 

Proposition 2. For any a < Eq and any alphabet A, Leaf{Sa{A'^SF[)) C 
Leaf{S^{{0,l}^BH)). 

7 Examples and Discussion 

The typed boolean hierarchy and the fine hierarchy are rather abstract and rich 
structures. In this section we formulate and discuss some interesting particular 
cases. 

Let again B = (i?; U, n,“ 0, 1) be a b.a. Define an operation of addition of 
classes X,Y C B hy the equal! 

ty X + Y = {xAy\x G X,y G Y}, where xAy is the symmetric difference of 
X and y. This operation is induced by the operation of addition modulo 2, hence 
it is associative and commutative and we may a fortiori freely use expressions 
like ^0 + ■ ■ ■ + Xn- 

Let L be a sublattice of (i?; U, fl, 0, 1) . For any k > 0,let Dk = L-\ \- L {k 

summonds in the righthand side). In [KSW86] it was shown that the sequence 
{Dk{L)} coincides with the well-known boolean (or difference) hierarchy over 
L. 

Taking now NP in place of L, one gets the boolean hiearachy over NP, a 
rather popular object in complexity theory introduced in [WW85]. More gener- 
ally, one could consider the boolean hierarchy {£>^(11^)} over the n-th level of 
the polynomial hierarchy. It is natural to ask: is there a natural description of 
these classes in terms of leaf languages? To answer the question, one have only to 
note that for any base L in B the boolean hierarchy over any class £„ is a frag- 
ment of the typed boolean hierarchy (as well as of the fine hierarchy), see [Se94]. 
E.g., we could consider the boolean hierarchy over any class Bn = {0, l}~^Bn of 
the Brzozowski hierarchy and immediately get 

Corollary 2. For all n,k > 0, Leaf{Dk{Bn)) = Dk{XP). 

For the case of the boolean hierarchy over NP and the boolean hierarchy 
over iSi the corresponding result was earlier obtained in [SW98]. 
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Another interesting example is the plus-hierarchy introduced implicitly in 
[Se94, Se95] and explicitely in [Se94a, Se99]. The levels of the plus-hierarchy over 
any base L are obtained when one applies the operation + introduced above to 
the levels for all n < u>. Any finite nonempty string a = (no, . . . , n^) of nat- 
ural numbers satisfying Uq > ■ ■ ■ > Uk defines the level Pa{L) = L^g H — • -I- 
of the plus-hierarchy over L. One easily checks that in this way we get actually 
all the levels of the plus-hierarchy, that the finite sequences specified above are 
ordered lexicographically with the order type and that U co(Pcr) C 
whenever a < t. 

The plus-hierarchy is again a fragment of the typed boolean hierarchy, so we 
get e.g. 

Corollary 3. For any sequence a as above, Leaf{Pc{{0,l}'^BH)) = P„{PH). 

What is the aim of proving results of this type? In our opinion, the existence 
of nontrivial connections between automata-theoretic and complexity-theoretic 
hierarchies is interesting in its own right and is somewhat unexpected. Maybe, 
some time results of this type may be even of use. E.g., assume for a moment that 
the Brzozowski hierarchy collapses. By the Theorem of Burtschik and Vollmer, 
the polynomial hierarchy would then collapse too. This is of course unlikely, 
hence the Brzozowski hierarchy should not collapse. And this is actually a proven 
fact of the automata theory [BK78]. From [Ka85] we know that the boolean 
hierarchy over any does not collapse, provided that PH does not collapse. 
Hence, the boolean hierarchy over any level of BH also should not collapse. And 
this was indeed proved in [Sh98,SS00], though the proofs are rather involved. 

Actually, we can even deduce results of this type from some known facts of 
complexity theory. As an example consider the following generalization of results 
from [Sh98, SSOO]. 

Corollary 4. The plus-hierarchy over {0, l}+PiJ does not collapse. 

Proof. Suppose the contrary. Then, by relativization of Corollary 3, the rela- 
tivization (with any oracle) of the plus-hierarchy over PH would collapse. By 
[Se94, Se94a, Se99], the relativization of PH (with any oracle) would collapse 
too, contradicting to a well-known result of complexity theory. 
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Abstract. Many documents such as Web documents or XML files have 
tree structures. A term tree is an unordered tree pattern consisting of 
internal variables and tree structures. In order to extract meaningful and 
hidden knowledge from such tree structured documents, we consider a 
minimal language (MINL) problem for term trees. The MINL problem 
for term trees is to find a term tree t such that the language generated by 
t is minimal among languages, generated by term trees, which contain all 
given tree structured data. Firstly, we show that the MINL problem for 
regular term trees is computable in polynomial time if the number of edge 
labels is infinite. Next, we show that the MINL problems with optimizing 
the size of an output term tree are NP-complete. Finally, in order to 
show that our polynomial time algorithm for the MINL problem can be 
applied to data mining from real-world Web documents, we show that 
regular term tree languages are polynomial time inductively inferable 
from positive data if the number of edge labels is infinite. 



1 Introduction 



Many documents such as Web documents or XML files have tree structures. In 
order to extract meaningful and hidden knowledge from such documents, we need 
tree structured patterns which can explain them. As tree structured patterns, 
a tree pattern PEE!, a type of objects 0 and a tree-expression pattern ini 
were proposed. In ISEH, we presented the concept of term trees as a graph 
pattern suited for representing unordered tree structured data. A term tree is an 
unordered tree pattern which consists of internal variables and tree structures. 
A term tree is different from the other representations proposed in \ 1 1114191 n\ in 
that a term tree has internal structured variables which can be substituted by 
arbitrary trees. A term tree t is said to be regular if the labels of all variables are 
different. In [Z], we proved that the matching problem for an extended regular 
term tree and a standard tree is NP-complete. However, in Eli we showed that the 
matching problem for a regular term tree whose variables consist of two vertices 
is computable in polynomial time. Then in this paper, we consider only such a 
regular term tree. Since a variable can be replaced by any tree, overgeneralized 
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Ti 




T2 




T3 



t 



t' 




Fig. 1. A term tree t and a term tree t' as an overgeneralized term tree and one of 
the least generalized term trees explaining Ti, T2 and T3, respectively. A variable is 
represented by a box with lines to its elements. The label of a box is the variable label 
of the variable. 



patterns explaining all given data are meaningless. Then the purpose of this 
work is to find one of the least generalized term trees explaining tree structured 
data. In Fig. ^ we give examples of term trees t and t' which can explain all 
trees T\, T 2 and T^. The term tree t is an overgeneralized term tree which is 
meaningless. But the term tree t' is one of the least generalized term trees. 

The concept represented by a term tree, which is called a term tree language, 
is the set of all trees obtained by replacing all variables by arbitrary trees. The 
minimal language (MINT) problem is the problem of finding a term tree whose 
term tree language is minimal among term tree languages containing all given 
trees. In isini, for a special type of regular term trees, which are called a regular 
term caterpillar, we showed that the MINT problem for regular term caterpillars 
is computable in polynomial time. In this paper, we show that the MINT prob- 
lem for regular term trees is computable in polynomial time if the number of 
edge labels is infinite. Moreover, we consider the following two problems. Firstly, 
MINT with Variable-size Minimization is the problem of finding a term tree t 
such that the term tree language of t is minimal and the number of variables in 
t is minimum. Secondly, MINT with Tree-size Maximization is the problem of 
finding a term tree t such that the term tree language of t is minimal and the 
number of vertices in t is maximum. Then we prove that the both two prob- 
lems are NP-complete. These results show the hardness of finding the optimum 
regular term tree representing all given data. 

In order to show that our polynomial time algorithm for the MINT problem 
can be applied to data mining from real-world Web documents, we show that 
regular term tree languages are polynomial time inductively inferable from pos- 
itive data if the number of edge labels is infinite. In |S|, we proposed a tag tree 
pattern which is a special type of a regular term tree and is suited for express- 
ing structures of XML documents, and presented an algorithm for generating all 
maximally frequent tag tree patterns. The results of this paper give a theoretical 
foundation of the result in 0. 

This paper is organized as follows. In Section El we give the notion of a term 
tree as a tree structured pattern. Also we formally define the MINT problem for 
regular term trees. In Section 0 we give a polynomial time algorithm solving the 
MINT problem for regular term trees if the number of edge labels is infinite. And 
we show the hardness of finding the optimal term tree which explaining all given 
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data in Section 0 We show that regular term tree languages are polynomial time 
inductively inferable from positive data in Section 0 

2 Preliminaries — Term Trees as Tree Structured Patterns 

Let T = (Vt, Et) be a rooted unordered tree (or simply a tree) with a set Vt of 
vertices, a set Et of edges, and an edge labeling. A variable in T is a list [u, u'] of 
two distinct vertices u and u' in Uj’. A label of a variable is called a variable label. 
A and X denote a set of edge labels and a set of variable labels, respectively, 
where AD X = 4>. For a set S, the number of elements in S is denoted by |S'|. 

Definition 1. A triplet g = {Vg, Eg, Hg) is called a rooted term tree (or simply a 
term tree) if Elg is a finite set of variables such that for any [u, u'] € Hg, [u' , u] is 
not in Hg, and the graph {Vg, EgVJE'g) is a tree where E'g = {{it, u} | [it, i>] G Hg}. 
A term tree g is called regular if all variables in Hg have mutually distinct 
variable labels in X. In particular, a term tree with no variable is called a ground 
term tree and considered to be a standard tree. TZTT and QTT denote the set of 
all regular term trees and the set of all ground term trees, respectively. 

For a term tree / and its vertices ui and Vi, a path from vi to Vi is a sequence 
vi,V 2 , ■ . ■ ,Vi of distinct vertices of / such that for 1 < j < i, there exists an edge 
or a variable which consists of Vj and u^+i. If there is an edge or a variable which 
consists of V and v' such that v lies on the path from the root of / to i>', then v 
is said to be the parent of v' and v' is a child of v. Without loss of generality, 
we assume that v' is a child of v if [u, v'] is a variable in /. 

Let / = {Vf,Ef,Hf) and g = {Vg, Eg,Hg) be regular term trees. We say that 
/ and g are isomorphic, denoted hy f = g, if there is a bijection cp from Vf to 
Vg such that (i) the root of / is mapped to the root of g by p, (ii) {u, u} G Ef if 
and only if {g}{u) , ip{v)} G Eg and the two edges have the same edge label, and 
(iii) [u,v] G Hf if and only if [ip{u) , g){v)] G Hg. Two isomorphic regular term 
trees are considered to be identical. 

Let / and g be term trees with at least two vertices. Let a = [u, u'\ be a list 
consisting of the root u ol g and another vertex u' in g. The form x := [g, cr] is 
called a binding for x. A new term tree f{x := [g, cr]} is obtained by applying the 
binding x := [g, a] to / in the following way: Let ei = [ui, u{], . . . , = [um, 

be the variables in / with the variable label x. Let g\, . . . ,gm be m copies of 
g and Ui,u{ the vertices of gi corresponding to u,u' of g, respectively. For each 
variable e, = [vi,v'j\, we attach gi to / by removing the variable from Hf and 
by identifying the vertices vt , v[ with the vertices Ui , u'i of gi . We define the root 
of the resulting term tree as the root of /. A substitution 0 is a finite collection 
of bindings {a;i := [gi, ai], ■ ■ ■ , Xn := [gn, cr„]|, where Xi’s are mutually distinct 
variable labels in X. The term tree f9, called the instance of / by 9, is obtained 
by applying the all bindings Xi := [gi,<Ji] on / simultaneously. For term trees 
/ and g, if there exists a substitution 9 such that / = g9, we write f ^ g. 
Especially we write / A 5 if / ^ f? and 5 ^ /. In Fig. |2| we give examples of a 
regular term tree, a substitution and an instance. 
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Fig. 2. Ground term trees t\ and t2, and an instance tO which is obtained by applying 
a substitution 6 = {x -.= [ti, [vi,V2]],y '■= {t2, [ui,U2]]} to the regular term tree t. A 
variable is represented by a box with lines to its elements. The label of a box is the 
variable label of the variable. 



For a (regular) term tree g, the (regular) term tree language L{g) of g is 
defined as L{g) = {h g QTT \ h = g9 for a substitution 9}. The class TZTTC of 
all regular term tree languages is defined as TZTTL — {L{g) \ g G TZTT}- Let S 
be a nonempty finite subset of QTT and r be a regular term tree. A regular term 
tree language L{r) is minimal for {S,TZTT) if (i) S C L{r) and (ii) L{s) ^ L{r) 
implies S % L{s) for any s G TZTT- 

Minimal Language (MINL) Problem for TZTT 

Instance: A nonempty finite subset S of QTT. 

Question: Find a regular term tree r such that L{r) is minimal for {SffZTT). 



3 Polynomial Time Algorithm for Finding Minimal 
Regular Term Languages 

Since solving the minimal language problem for TZTT is essential to the learn- 
ability of regular term tree languages, we give the following theorem. 

Theorem 1. The minimal language problem for TZTT with infinite edge labels 
is computable in polynomial time. 

Proof. We show that the procedure MINL{S) (Fig. ED works correctly for finding 
a regular term tree r such that the language L(r) is minimal for {S,TZTT). Let 
/ = {Vf,Ef,Hf) and g = {Vg, Eg, Hg) be term trees. We write h fv g if there 
exists a bijection ( ■ Vh ^ Vg such that for u,v G 14, {u, v} G Eh or [u, u] G Hh 
if and only if {C(“)j ?('*^)} G Eg or [^(u),^(u)] G Hg. For TZTT with infinite edge 
labels, the following two claims hold: 

Claim 1. For any g,h G TZTT with hK, g, h < g ii and only if L(h) C L{g). 

Proof of Claim 1. If h < g, we have L(h) C L{g) straightforwardly. Then we 
show that h < g ii L{h) C L{g). Since h k, g, there exists a bijection ^ such 
that for u,v G 14, {u, u} G Eh or [u,u] G Hh if and only if {C(w),^(u)} G Eg or 
[^(it),^(u)] G Hg.li there does not exist a bijection ( such that [^(u),^(u)] is in 
Hg for all [u,u] G Hh, the ground term tree which is obtained by replacing all 
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Procedure MINL{S); 

Input: a nonempty finite set S of ground term trees. 

Output: a regular term tree g such that the language L[g) is minimal for [S,TZTT). 

begin 

g Basic- Tree(S'); 

foreach variable \u, u] £ Hg do 

foreach edge label c which appears in S do begin 
let g' be a term tree which is obtained from g 

by replacing variable [m, u] with an edge labeled with c; 
if S C L{g') then begin g := g'; break end 
end 

end; 



Procedure Basic- Tree(S'); 

begin / / Each variable is assumed to be labeled with a distinct variable label. 
d:=0; g := ({r},0,0); 
g := breadth-expansion(r, g, S); 

max-depth :=the maximum depth of the trees in S; 
d := d 1 ; 

while d < max-depth — 1 do begin 

V :=a vertex at depth d which is not yet visited; 
g :=breadth-expansion(v, g, S); 

while there exists a sibling of v which is not yet visited do begin 
Let v' be a sibling of v which is not yet visited; 
g :—breadtb-expansion(v',g,S) 

end; 

d := d -I- 1 

end; 
return g 
end; 



Procedure breadth-expansion(v , g , S); 

begin 

g' ~depth-expansion(v,g,S)\ 

while g ^ g' do begin 

9 ■■= g'-, 

g' :—depth-expansion{v,g,S) 

end; 
return g 
end; 



Procedure depth-expansion(v, g, S); 

begin 

Let g be (U,,0,B'<,); 

Let v' be a new vertex and 
[w, u'] a new variable; 
g' := {Vg U {v'},%,Hg U {[u,u']}); 
while S C L(g') do begin 
g ■- g'; v := v'; 

Let v' be a new vertex and 
[u, v'\ a new variable; 
g' ■- (Vg U {u'},0,idg U {[u,u']}) 
end; 
return g 
end; 



Fig. 3. MINL{S): An algorithm for finding a regular term tree r such that the language 
L(r) is minimal for {S,TZTT). 
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variables in h with an edge label which does not appear in g is not in L{g). Then 
G Hg for all [u,v] G Hh- Therefore h<g. {End of Proof of Claim 1) 

By Claim 1 we can replace the inclusion relation C on TZTTC with the relation 
^ on TZTT. 

Claim 2. Let g = {Vg, Eg, Hg) be an output regular term tree by the procedure 
MINE, given a nonempty finite set S of ground term trees. If there exists a 
regular term tree h with S C L{h) C L{g), then h g. 

Proof of Claim 2. Let g' be the regular term tree which is obtained by Basic- 
Tree{S). Then L{h) C L{g) C L{g') and g ~ g'. Let h' be the regular term 
tree which is obtained by replacing all edges in h with variables. Then h k, h' 
and L{h) C L{h'). Let 0 be a substitution which realizes h = g'9 and 6' a 
substitution which obtained by replacing all edges appearing in 6 with variables. 
Then h' = g'9'. Since S C L{h) C L{h'), S C L{g'9'). Since Basic-Tree{S) 
generates a regular term tree whose language is minimal for {S, {g G TZ’T'T \ 
g has no edge}), g'9' = g'. Therefore h' = g', then h k, g. {End of Proof of 
Claim 2) 

Suppose that there exists a regular term tree h = {Vh,Eh,Hh) such that 
S C L{h) C L{g). From the above two claims, we obtain h ^ g and h k, g. 
There exist a variable [m, u] G Hg labeled with x and an edge label a such that 
h ^ g{x := [Ta, where Ta is a tree consisting of an edge {u' ,v'} labeled 

with a. We denote by / the regular term tree which is obtained in the procedure 
MINE before trying to replace the variable [u, u] with an edge labeled with a. 
Then we have that S 2 L{f{x '■= [Ta, and there exists a substitution 

9 with g = f9. From h ^ g{x := [Ta, [u',u']]} = f9{x := [Ta, [u',z;']]} = f{x := 
[Ta,[u' ,v']]}9, we have h ^ f{x := [Ta,[u' ,v']]}. Thus S C E{h) C E{f{x := 
[Ta, [it',u']]}). This contradicts S % E{f{x := [Ta, [u',u']]}). 

In jS|, we gave an algorithm for deciding whether or not a given ground 
term tree T is a member of E{g) for a given regular term tree g. The algorithm 
runs in 0{n^N^'^) time where n and N are the number of vertices in g and T 
respectively. Let Nmax and Nmin be the maximum and minimum numbers of 
vertices of ground term trees in S respectively. Since the final regular term tree 
g is no larger than the smallest ground term tree in S, the algorithm MINE{S) 
checks at most 0{Nmin) time whether or not S C E{g). Therefore the algorithm 
runs in time. □ 

4 Hardness Results of Finding Regular Term Trees of 
Minimal Language 

In Section 0 we have given a polynomial time algorithm for finding a regular 
term tree of minimal language from a given sample. In this section, we discuss 
MINL problems with optimizing the size of an output regular term tree. 

MINL with Variable-size Minimization 

Instance: A nonempty finite subset S of QTT and a positive integer K. 



Polynomial Time Algorithms for Finding Unordered Tree Patterns 341 





(xi, X2, X3) = {true, false, true) 



(xi,X2,X3) = {true, false, false) 



Fig. 4. Tj for a clause Cj = {xi,X 2 ,X 3 }. Pi (1 < i < 7) is constructed systematically 
by removing appropriate 3 branches labeled with T or F from Pq. We describe only Pi 
and p 2 in this figure. 




T T' 



Fig. 5. Two special sample trees T and T' . 

Question: Is there a regular term tree t = {V,E,H) with \H\ < K such that 
L{t) is minimal for {S,TZTT)‘! 

Theorem 2. MINL with Variable-size Minimization is NP-complete. 

Proof. Membership in NP is obvious. We transform 3-SAT to this problem. Let 
U = {x\, . . . ,Xn} be a set of variables and C = {ci, . . . ,Cm} be a collection of 
clauses over U such that each clause (1 < j < m) has \cj\ = 3. We use symbols 
T, F, di, c? 2 , (^ 4 , ds, xi, . . . , Xn as edge labels. 

We construct trees Ti, ... , Tm from ci, . . . , c^. Let Pq be the tree which is 
described in Fig. 0 For a clause Cj, which contains Xj.^,Xj.^,Xj^ as positive or 
negative literals, we have the 7 truth assignments to Xjj^,Xj 2 ,Xj^ which satisfy 
the clause cj. For the ith truth assignment {xjj^,Xj^,Xj^) = (6ii, (* = 
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Fig. 6. The regular term trees which generate minimal languages for {31,(72} 

where both 31 and 32 have the same edge label Xi. 




Fig. 7. The regular term trees which generate minimal languages for {31,32} 

where 31 and 32 have distinct edge labels Xi and Xj {i ^ j), respectively. 



1,2,..., 7), we construct Pi by removing the branches which are labeled with 
hi , bi2 , bi3 from the subtrees corresponding to Xj^ , Xj^ , xj^ of Pq , respectively. For 
example, for a clause {cci, T2, 2:3} the tree Pi in Fig. 2]shows an assignment xi = 
true,X2 = false, X3 = true and the tree P2 shows Xi = true,X2 = false, X3 = 
false. We also construct two special trees T and T' (Fig.0. T and T' have 7 
subtrees like Tj. Only one subtree of them is distinct. Let S = {Ti, . . . , Tm, T, T'} 
be a sample set. Lastly we set K = In. The depth of a vertex v is the length of 
the unique path from the root to v. 

Fact 1. For any regular term tree g which is minimal for (S,TZPP), (i) the root 
of g has just 7 children which connect to the root by edges labeled with di, (ii) 
each vertex of depth 1 of 3 has just n children which connect to the parent by 
edges labeled with d2, and (iii) each vertex of depth 2 of 3 has just one child. 

This fact means that any regular term tree 3 which is minimal for (S,TZPP) 
has at least 7n variables each of which locates at each subtree rooted at a vertex 
of depth 3. 

Fact 2. Let 31 and 32 be trees described in Fig. 0 There are only three regular 
term trees which are minimal for ({31, 32}, 72.TT). The three regular term trees 
are described in Fig.0 



Polynomial Time Algorithms for Finding Unordered Tree Patterns 343 




Fig. 8. The output regular term tree when there is a truth assignment which satisfies 
C (Theorem 0. 



Fact 3. Let g[ and g '2 be trees described in Fig. Q There are only three regular 
term trees which are minimal for ({ 51 , 32 })^"^"^)- The three regular term trees 
are described in Fig.|3 

From the above facts, if 3-SAT has a truth assignment which satisfies all 
clauses in C, there is a regular term tree t = (F, E, H) with \H\ = In such that 
L{t) is minimal for TZET) (Fig. E)- Conversely, if there is a regular term tree 
t = {V, E, H) with \H\ = In such that L(t) is minimal for {S, TZET), the regular 
term tree is isomorphic to the one which is described in Fig. 0 by ignoring edge 
labels T and F. For 1 < z < n, we assign true to Xi if the nearest edge label from 
Xi of type T,F is T, otherwise we assign false to Xi- Then this truth assignment 
satisfies C. □ 

Next we show that it is hard to compute the regular term tree of maximum 
tree-size which is minimal for (S,TZEE) for a given sample set S. 

MINL with Tree-size Maximization 

Instance: A nonempty finite subset S of QTT and a positive integer K. 

Question: Is there a regular term tree t = (F, E, H) with |F| > AT such that 

L{t) is minimal for {S,TZTT)‘l 

Theorem 3. MINL with Tree-size Maximization is NP-complete. 

Proof. (Sketch) Membership in NP is obvious. We transform 3-SAT to this prob- 
lem in a similar way to Theorem |21 We use only blank as an edge label. Each 
variable Xi {1 < i < n) transforms a tree of the form shown in Fig. 0 It is easy 
to see that the number of vertices of a regular term tree which matches the trees 
corresponding to Xi and Xj {i 7 ^ j) is at most 5(n + 1). 

We construct trees Ti , . . . , Tm from ci , . . . , Cm in a similar way to Theorem 0 
but we use subtrees Pi, . . . (Pi is shown in Fig.ITTIl instead of Pi, ... in Fig. 0 
Moreover we construct one special tree T (Fig. EJ. Let S = {Pi, . . . , Tm, T} be 
a sample set of QTT. Lastly we set K = 35n^ + 92n + 8. Then, we can compute 
a regular term tree t = (V,E,H) with \H\ = K such that L{t) is minimal for 
{s,nTr) (Fig.Ei) if and only if 3-SAT has a truth assignment which satisfies 
all clauses in C. □ 
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Fig. 9. Subtrees corresponding to Xi, . . . ,Xn- 





Fig. 10. A subtree Pi corresponding to a truth assignment and a special sample tree 
T. 




Fig. 11. The output regular term tree when there is a truth assignment which satisfies 
C (Theorem E|) . 



Polynomial Time Algorithms for Finding Unordered Tree Patterns 345 



5 Polynomial Time Inductive Inference of Regular Term 
Tree Languages from Positive Data 

In this section, a language is a subset of QTT. An indexed family of reeursive 
languages (or simply a class) is a family of languages C = {Li, L 2 , • ■ ■} such that 
there is an effective procedure to decide whether w € Li or not, given a tree w 
and an index i. In our setting, an index is a term tree, and the language Li with 
an index i is the term tree language L(i) of a term tree i. A class C is said to 
have finite thickness, if for any nonempty finite set T C Q'T'T, the cardinality of 
{L G C \ T C L} is finite. 

An inductive inference machine (IIM, for short) is an effective procedure 
which requests inputs from time to time and produces indices as hypotheses from 
time to time. A positive presentation cr of a nonempty language L is an infinite 
sequence W\,W 2 , - ■ ■ of trees in QTT such that {wi,W 2 , • • •} = L. We denote by 
cr[n] the cr’s initial segment of length n > 0. For an IIM and a finite sequence 
cr[n] = wi,W 2 , ■ ■ ■ ,Wn, we denote by M(a[n]) the last hypothesis produced by 
M which is successively presented wi,W 2 , ■ ■ ■ , Wn on its input requests. An IIM 
is said to converge to an index r for a positive presentation a, if there is an n > 1 
such that for any m > n, M{a[m]) is r. Let C be a class and M be an IIM for 
C. An IIM M is said to infer a class C in the limit from positive data, if for any 
L G C and any positive presentation a oi L, M converges to an index r for cr 
such that L = Lr- A class C is said to be inferable in the limit from positive data, 
if there is an IIM which infers C in the limit from positive data. 

A class C is said to be polynomial time inductively inferable from positive 
data if there exists an IIM for C which outputs hypotheses in polynomial time 
with respect to the length of the input data read so far, and infers C in the limit 
from positive data. 

Theorem 4 ([2j,[lUj). Let C be a class. IfC has finite thickness, and the mem- 
bership problem and the minimal language problem for C are computable in poly- 
nomial time, then C is polynomial time inductively inferable from positive data. 

The membership problem for TZTTC is, given a ground term tree g and a 
regular term tree t, the problem of deciding whether or not g G Lft). We gave a 
polynomial time algorithm for the membership problem for TZTTC, pj . Since the 
class TZTTC has finite thickness |S|, from Theorem I I 141 we have Theorem O 

Theorem 5. The class TZTTC with infinite edge labels is polynomial time induc- 
tively inferable from positive data. 

6 Conclusions 

The minimal language problem is a kernel problem in learning methods such as 
inductive inference. We have given a polynomial time algorithm for solving the 
minimal language problem for regular term trees. From the viewpoint of compu- 
tational complexity, we have shown that it is hard to solve the minimal language 
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problems with optimizing the size of an output regular term tree. By using our 
polynomial time algorithm, we have shown the regular term tree language with 
infinite edge labels is polynomial time inductively inferable from positive data. 

We can give membership and MINL algorithms to the following two classes 
of regular term trees with no edge label: VTTCi = {L{g) \ g G TZTT and the 
number of children of every vertex in g is not 2} and TZTTC^ = {L{g) \ g € TZ’T'T 
and (i) for every pair of vertices in t whose degrees are more than 2, there exists 
a vertex of degree 2 on the path between them, and (ii) there is no vertex of 
degree 3 in t such that the distance between any leaf and the vertex is at least 
2}. Therefore the classes are polynomial time inductively inferable from positive 
data. But it is an open question whether or not the class of all regular term 
tree languages with no edge label is polynomial time inductively inferable from 
positive data. 
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Abstract. The necessary and sufficient conditions for an automaton to 
be locally threshold testable are found. We introduce the polynomial 
time algorithm to verify local threshold testability of the automaton of 
time complexity 0(u®) and an algorithm of order 0(n®) for the local 
threshold testability problem for syntactic semigroup of the automaton. 
We modify necessary and sufficient conditions for piecewise testability 
problem for deterministic finite automaton and improve the Stern algo- 
rithm to verify piecewise testability for the automaton. The time com- 
plexity of the algorithm is reduced from 0(n®) to O(n^). An algorithm 
to verify piecewise testability for syntactic semigroup of the automaton 
of order O(n^) is presented as well. 

The algorithms have been implemented as a package. 

Keywords: automaton, locally threshold testable, piecewise testable, 
locally testable, transition graph, syntactic semigroup, algorithm 



Introduction 

The concept of local testability was introduced by McNaughton and Papert El 
and by Brzozowski and Simon 0. Local testability can be considered as a special 
case of local ^-threshold testability for Z = 1. 

Locally testable automata have a wide spectrum of applications. Regular 
languages and picture languages can be described by help of a strictly locally 
testable languages PI, PI Local automata (a kind of locally testable automata) 
are heavily used to construct transducers and coding schemes adapted to con- 
strained channels |p. Locally testable languages are used in the study of DNA 
and informational macromolecules in biology |B|. 

Kim, McNaughton and McCloskey nm have found necessary and sufficient 
conditions of local testability and a polynomial time algorithm for local testa- 
bility problem based on these conditions. The realization of the algorithm is 
described by Caron jOj. A polynomial time algorithm for local testability prob- 
lem for semigroups was presented in nn. 

The locally threshold testable languages were introduced by Beauquier and 
Pin j2j . These languages generalize the concept of locally testable language and 
have been studied extensively in recent years. An important reason to study 
locally threshold testable languages is the possibility of being used in pattern 
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recognition m- Stochastic locally threshold testable languages, also known as 
n — grams are used in pattern recognition, particular in speech recognition, both 
in acoustic-phonetics decoding as in language modeling m- 

For the state transition graph F of an automaton, we consider some sub- 
graphs of the cartesian product F x F and F x F x F. In this way, necessary and 
sufficient conditions for a deterministic finite automaton to be locally threshold 
testable are found. It gives a positive answer on Caron’s question [Zj. We present 
here O(n^) time algorithm to verify local threshold testability of the automaton 
based on this characterization. By n is denoted here the sum of the nodes and 
edges of the graph of the automaton (n can be also considered as the product 
of the number of states by the size of the alphabet). 

The local threshold testability problem for semigroup is, given a semigroup, 
to decide, if the semigroup is locally threshold testable or not. We present a 
polynomial time algorithm for this problem of order O(n^). By n is denoted 
here the size of the semigroup. 

A language is piecewise testable iff its syntactic monoid is /-trivial uni- 

Stern HH modified these necessary and sufficient conditions and described a 
polynomial time algorithm to verify piecewise testability of deterministic finite 
automaton of order O(n^). The algorithm was implemented by Caron j0|. Our 
aim is to reduce the last estimation. 

We modify necessary and sufficient conditions m, d for the piecewise 
testability problem and describe an algorithm to verify piecewise testability of 
deterministic finite automaton and of his syntactic semigroup of order 0{n^). 

Necessary and sufficient conditions of local testability m are considered in 
this paper in terms of reachability in the graph F x F. New version of 0{n^) time 
algorithm to verify local testability based on this approach will be presented too. 

The considered algorithms have been implemented as a part of C/ pack- 
age TESTAS (testability of automata and semigroups). 



Notation and Definitions 

Let E be an alphabet and let 17+ denote the free semigroup on E.lfwG F7+, let 
|w| denote the length of w. Let A: be a positive integer. Let ik{w) [tk{w)] denote 
the prefix [suffix] of w of length fc or re if \w\ < k. Let Fkj{w) denote the set of 
factors of w of length k with at least j occurrences. A language L [a semigroup 
S'] is called 1-threshold k-testable if there is an alphabet E [and a surjective 
morphism </> : 17+ — >■ S] such that for all u, v € 17+, if ik-i{u) = ik-i{v), 
tk-i{u) = tk-i{v) and F}^ j(u) = Ff^ j(v) for all j < I, then either both u and v 
are in L or neither is in L [u(j) = vfj)] ■ 

An automaton is ^-threshold fc-testable if the automaton accepts a l- 
threshold fc-testable language [the syntactic semigroup of the automaton is l- 

threshold fc-testable] . 

A language L [a semigroup, an automaton] is locally threshold testable 
if it is /-threshold /c-testable for some k and 1. 
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Piecewise testable languages are the finite boolean combinations of the lan- 
guages of the form A*a\A*a 2 A* ...A*akA* where /c > 0, Oi is a letter from the 
alphabet A and A* is a free monoid over A. 

|T| denotes the number of nodes of the graph F. 

T* denotes the direct product of i copies of the graph T. 

The edge p^, ^ qi, q„ in T* is labeled by a iff for each i the edge 
Pi — >• qi in T is labeled by a. 

A maximal strongly connected component of the graph will be denoted for 
brevity as SCC, a finite deterministic automaton will be denoted as DFA. 
Arbitrary DFA is not necessary complete. A node from an SCC will be called 
for brevity as an SCC — node. SCC — node can be defined as a node that has 
a right unit in transition semigroup of the automaton. 

The graph with trivial SCC is called acyclic. 

If an edge p — >■ q is labeled by a then let us denote the node q as per. 

We shall write p h Q if the node q is reachable from the node p or p = q. 

In the case p h Q and q ^ p we write p q (that is p and q belong to one 
SCC or p = q). 

The stabilizer A(q) of the node q from F is the subset of letters a G S 
such that any edge from q labeled by cr is a loop q — >■ q. 

Let F{Si) be the directed graph with all nodes from the graph F and edges 
from F with labels only from the subset Si of the alphabet S. 

So, F(A’(q)) is a directed graph with nodes from the graph F and edges from 
F that are labeled by letters from stabilizer of q. 

A semigroup without non-trivial subgroups is called aperiodic j^. 

Let p be a binary relation on semigroup S such that for a,b G S apb iff for 
some idempotent e G S ae = a, be = b. Let A be a binary relation on S such 
that for a,b G S aXb iff for some idempotent e G S ea = a, eb = b. 

The unary operation a;“ assigns to every element a; of a finite semigroup S 
the unique idempotent in the subsemigroup generated by x. 



1 The Necessary and Sufficient Conditions of Local 
Threshold Testability 

Let us formulate the result of Beauquier and Pin |2| in the following form: 

Theorem 11 m A language L is locally threshold testable if and only if the 
syntactic semigroup S of L is aperiodic and for any two idempotents e, f and 
elements a, u, b of S we have 



eafuebf = ebfueaf 



( 1 . 1 ) 



Lemma 12 Let the node (p, qj be an SCC-node of F^ of a locally threshold 
testable DFA with state transition graph F and suppose that p ~ q. 

Then p = q. 
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Proof. The transition semigroup S of the automaton is finite and aperiodic m 
Let us consider the node (p, q) from SCC X of T^. Then for some element e G S 
we have qe = q and pe = p. In view of qe* = q, pe® = p and finiteness of S 
we can assume e is an idempotent. In the SCC X for some a, b from S we have 
pa = q and q6 = p. Hence, peae = q, qe6e = p. So peae6e = p = p(eae6e)® 
for any integer i. There exists a natural number n such that in the aperiodic 
semigroup S we have (eae)" = (eae)"+^. From theorem ITTl it follows that for 
the idempotent e, eaeebe = ebeeae. We have p = peae6e = p(eaee6e)" = 
p(eae)"(e&e)" = p(eae)"+^(e6e)" = p(eae)”(e6e)”eae = peae = q. So p = q. 

Theorem 13 For DF A A with state transition graph F the following three 
conditions are equivalent: 

1 )A is locally threshold testable. 

2) If the nodes fp,q 2 ,ri^ and (^q,r,t^,t^ are SCC-nodes of F^ and F'^ , cor- 
respondingly, and 

(q,r) ^ (qiTi), (p,qi) ^ (r,t), (p,ri) ^ (q,ti) holds in F^ 
then t = ti . 

3) If the node (w, w) is an SCC-node of the graph F^ and w ~ v then w = v. 
If the nodes (p,q 2 ,ri^, fq,r,t^, ('qjTjtj) are SCC-nodes of the graph F^ 

and 

(q,r) h (qi,n), (p,qi) ^ (r,t), (p,ri) ^ (q,ti) hold in F"^ , 
then t ^ ti . 



2 ) 



3 ) 




ti = t 




Let us consider the nodes zobfueaf and zeafuebf where z is an arbitrary 
node of F, a, u, b are arbitrary elements from transition semigroup S of the 
automaton and e, / are arbitrary idempotents from S. Let us denote 

ze = p, zebf = q, zea/ = r, zeafue = ri, zebfue = qi, zebfueaf = t, 
zeafuebf = ti. 

By condition 2), we have t = ti, whence zebfueaf = zeafuebf. Thus, the 
condition eafuebf = ebfueaf mu holds for the transition semigroup S. By 
theorem o the automaton is locally threshold testable. 

1) -)> 3): 

If the node (w, v) belongs to some SCC of the graph F^ and w ~ v then by 
lemmafT^local threshold testability implies w = v. 

The condition eafuebf = ebfueaf 1 II I . I II . theorem II III holds for the tran- 
sition semigroup S of the automaton. Let us consider nodes p, q, r, t, qi, ri, ti 
satisfying the condition 3). Suppose 
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(p,qi,ri)e = (p,qi,ri), (q,r,t)/2 = (q,r,t), (q,r,ti)/i = (q,r,ti) 
for some idempotents e, /i, /2 G S, and 

(p,qi)a= (r,t), {p,r^)b= (q,ti), (q,r)u= (qi,ri) 
for some elements a,b,u € S. Then pea/2 = pea/i and pebf2 = pebfi. 

We have ti/2 = peafiuebfif2- By theorem^ pebfjueafj = peafjuebfj for 
j = 1 , 2 . So we have ti/2 = peafiuebfif2 = pebfiueafif2- In view of pebf2 = 
pebfi and ft = fif^ we have ti/2 = pebf2f2ueafif2- By theorem HU, ti/2 = 
Pe{bf2)f2ue{afi)f2 = pe(a/i)/2Me(6/2)/2- Now in view of pea/2 = pea/i let 
us exclude /i and obtain ti/2 = pea/2 ^06/2 = t. So 1 1/2 = t. Analogously, 
t/i = ti. 

Hence, ti ~ t. Thus 3 ) is a consequence of 1 ). 

3 ) -l- 2 ): 

Suppose that (p,qi,ri)e = (p,qi,ri), (q,r,t,t;i)/ = (q,r,t,ti), for some 
idempotents e, / from transition semigroup S of the automaton and 
(p,qi)a= (r,t), {p,ri)b= (q,ti), (q,r)u= (qi,ri) 
for some elements a, u, b € S. Therefore 

(p, qi)ea/ = (p, qi)a/ = (r, t) 

(p,ri)e&/ = {p,ri)bf = (q,ti) 

(q,r)u = (q,r)/we = (qi,ri) 
for idempotents e, / and elements a, u, b € S. 

For / = /i = /2 from 3 ) we have t ^ ti. Notice that (ti,t)/ = (ti,t). The 
node (ti, t) belongs to some SCC of the graph and t ~ ti, whence by lemma 

im t = ti. 



Lemma 14 Let the nodes (q,r,tj^^ and (q,r,t 2 j be SCC-nodes of the graph 
of a locally threshold testable DFA with state transition graph F. Suppose 
that (p,ri) ^ (q,tj), (p,ri) h (q,t 2 ) in the graph and p h r ^ fi- 
Then ti ^ t2. 




7i 

Proof. Suppose that the conditions of the lemma hold but ti / t2. 

We have (p,ri)e = (p,ri), (q,r,ti)/i = (q,r,ti), (q,r,t2)/2 = (q,r,t2), for 
some idempotents e, /2, /2 from the transition semigroup S of the automaton 
and 



(p,ri) 5 i = (q,ti), (p,ri)62 = (q,t2), pa = r, rw = ri 
for some elements a, u, bi, 62 € S. 

If ti/2 ^ t2 and 12/1 ^ ti then t2 ti in spite of our assumption. Therefore 
let us assume for instance that ti 7^ 12/1. (And so ti yf t2/i). This gives 
us an opportunity to consider 12/1 instead of t2. So let us denote t2 = 12/1, 
f = fi = /2- Then 12/ = t2, tif = ti and ti t2. Now 
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pea/ = pa/ = r 

(p,ri)e6i/ = (p,ri)6i/ = (q,ti) 

(p,ri)e&2/ = {p,r^)b2f = (q,t2) 
ru = rue = ri 

Let us denote qi = que and t = qia/i. Then 
(p,qi,ri)e = (p,qi,ri), (q,r)ue = (qi,ri), (q,r,t,tj/ = (q,r,t,tj 
So the node (p, qi, ri) is an S'C'C'-node of the graph the nodes (p, q, r, tj) 
are S'C'C'-nodes of the graph F'^ for i = 1,2 and we have (q, r) ^ (qi,ri), 
(P,qi) ^ (r,t) and (p,r^) F (q,!^) for i = 1,2. 

Therefore, by theorem E] (2), we have ti = t and t 2 = t. Hence, ti ^ t 2 , 
contradiction. 



Definition 15 For any four nodes p,q,r,rj^ of the graph F of a DFA such 
that p ^ r ^ ri, p ^ q and the nodes (p,v-y), (q_,v) are SCC- nodes, let 
Tscc{p,Q.AAi) be the SCC of F containing the set 

T(p,q,r,r^) := {t |(p,r^) ^ (q,t) and fq,r,tj is an SCC-node} 




t G Tscc(p,q,r,r-i) 



In virtue of lemma El the SCC Tscc{p,<iAAi) of ^ locally threshold 
testable DFA is well defined (but empty if the set T(p, q, r,rj^) is empty). 
Lemma m and theorem ini(3) imply the following theorem 



Theorem 16 A DFA A with state transition graph F is locally threshold 
testable iff 

1 )for every SCC-node (p, qj of F"^ P q implies p = q 
and 

2)for every five nodes p,q,r,qj^,ri of the graph F such that 

— the non-empty SCC r 5 cc(P 7 qj r, rj^) and Tscc{PAi<ij^.i) exist, 

— the node (p, q^jTi^ is an SCC-node of the graph F^, 

— (q,r) b (qi,ri) in F'^, 

holds Tsc’c(p,q,r,ri) = ?scc(p, r, q, qj- 




7scc(p,q,r,ri) 



Tscc(P7r,q, qi) 



Let us go to the semigroup case. 
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Lemma 17 Elements s,t from semigroup S belong to subsemigroup eSf where 
e and f are idempotents if and only if spt and sXt. 

The proof follows from the definitions of p and A. 

Theorem 18 A language L is locally threshold testable if and only if the syn- 
tactic semigroup S of L is aperiodic and for any three elements s, u, t of S such 
that spt and sXt we have 

sut = tus (1-2) 

The proof follows from theorem and lemma ^3 

2 An Algorithm to Verify Local Threshold Testability of 
DFA 

A linear depth-first search algorithm finding all SCC (see lEI) will be used. 

By n will be denoted the sum of the nodes and edges of the graph. 



2.1 To Check the Reachability on an Oriented Graph 

For a given node qo, we consider depth-first search from the node. First only qo 
will be marked. Every edge is crossed two times. Given a node, the considered 
path includes first the ingoing edges and then the outgoing edges. After crossing 
an edge in the positive direction from the marked node q to the node r we mark 
r too. The process is linear in the number of edges (see m for details). 

The set of marked nodes forms a set of nodes that are reachable from qo. 
The procedure may be repeated for any node of the graph G. 

The time of the algorithm for all pairs of nodes is O(n^). 

2.2 To Verify Local Threshold Testability of DFA 

Let us find all SCC of the graphs E, E^ and E^ and mark all S'C'C'-nodes (O(n^) 
time complexity). 

Let us recognize the reachability on the graph E and E^ and form the table 
of reachability for all pairs of E and E^. The time required for this step is O(n^). 

Let us check the conditions of lemma El For every S'C'C-node (p, q) (p yf q) 
from E^ let us check the condition p ^ q. A negative answer for any considered 
node (p, q) implies the validity of the condition. In opposite case the automaton 
is not locally threshold testable. The time of the step is O(n^). 

For every four nodes p,q,r,r^ of the graph E, let us check the following 
conditions (see EJ: P r ^ ri and p ^ q. In a positive case, let us form 
SCC Tsc'c(p, q, r,r^) of all nodes t G E such that (p,r^) ^ (q, t) and (q,r,t) 
with (p,r^^) are S'C'C'-nodes. In case that SCC Tscc is not well defined the 
automaton is not threshold testable. The time required for this step is O(n^). 

For every five nodes p, q, r, qj^, ri from E we check now the second condition 
of theorem rm If non-empty components Tscc(p, Qj r, r;^) and Tscc(p, r, q, qi) 
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exist, the node (p, Qi, ri) is an S'CC'-node of the graph and (q, r) ^ (qi, ri) in 
-T^, let us verify the equality T 5 C'c(p, Q, r, r^) = Tsc'c(p, r, q, q^). If the answer 
is negative then the automaton is not threshold testable. A positive answer for 
all considered cases implies the validity of the condition of the theorem. The 
time required for this step is 0{n^). 

The whole time of the algorithm to check the local threshold testability is 
0(n5). 

3 Verifying Local Threshold Testability of Finite 
Semigronp 

The algorithm is based on the theorem ITRl 

Let Si be an element of the semigroup S, n = [S'!. 

— For any s € S let us find and check the aperiodity. The semigroup S is 
not locally threshold testable for non-aperiodic S. 

— Let us form a binary square table L [i?] of the size n in the following way: 
For any i,j < n suppose = 1 [i?ij = 1 ] if there exists an idempotent 
e € S such that esi = Si and esj = Sj [ s^e = Si and sje = Sj ]. In opposite 
case Lij [Rij] = 0. 

The step has order 0{rfi). 

— Let us find the intersection pH A and form a binary square table LR\ LRi j = 

■ 

— For any triple Si,Sj,Sk G S where Si{p C\ A)sj, let us check the condition 
SiSkSj = SjSkSi- The validity of the condition for any triple of elements 
implies local threshold testability of S. In opposite case S is not locally 
threshold testable. 

The step has order 0{n^). 

The algorithm to verify local threshold testability of the semigroup S has order 
0(n3). 

4 An Algorithm to Verify Local Testability of DFA 

We present now necessary and sufficient conditions of local testability of Kim, 
McNaughton and McCloskey (^0]) in the following form: 

Theorem 41 fITTy) A DFA with state transition graph F and transition semi- 
group S is locally testable iff the following two conditions hold: 

1 )For any SCC-node (p, qj from F^ such that p ~ q we have p = q. 

2)For any SCC-node from F^ such that p q and arbitrary element 

s from S we have ps ^ q is valid iff qs ^ q. 



The theorem implies 
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Corollary 42 A DFA with state transition graph F over alphabet S is locally 
testable iff the following two conditions hold: 

1) For any SCC-node (p, qj from F^ such that p ~ q we have p = q. 

2) For any node (v,s) and any SCC-node (p,qj from F^ such that (p, q) ^ 
(r,s), p ^ q, s ^ q and for arbitrary a from S we have ra F s is valid iff 
scr ^ s. 

In j1 (1) . a polynomial time algorithm for local testability problem was con- 
sidered. Now we present another simplified version of such algorithm with the 
same time complexity. 

Let us form a table of reachability on the graph F (0(n^) time complexity). 
Let us find F^ and all ^CC-nodes of F^. 

For every S'C'C'-node (p, q) (p ^ q) from F^ let us check the condition p ~ q. 
time complexity). If the condition holds then the automaton is not locally 
testable m- 

Then we add to the graph new node (0, 0) with edges from this node to every 
^CC-node (p,q) from such that p h q- Let us consider first-depth search 
from the node (0,0). 

We do not visit edges (r,s) — >• (r,s)cr from the graph such that str ^ s 
and rcr ^ s. In the case that from the two conditions scr ^ s and rcr ^ s only 
one is valid the algorithm stops and the automaton is not locally testable. 

In the case of absence of such cases on the path the automaton is locally 
testable. (O(n^) time complexity). 

The whole time of the algorithm to check the local testability is O(n^). 

5 Piecewise Testability 

The following result is due to Simon: 

Theorem 51 m Let L be a regular language over the alphabet E and let F 
be the minimal automaton accepting L. The language L is piecewise testable if 
and only if the following conditions hold 

(i) F is a directed acyclic graph; 

(ii) for any subset Ei from alphabet E each connected component of the graph 
F{Ei) has a unique maximal state. 



Lemma 52 Let the state transition graph F of some DFA be acyclic. Suppose 
that for some subset Ei of the alphabet E the graph F{Ei) has two distinct 
connected maximal nodes. 

Then for some node p from F the graph F(E(p)) has also two distinct con- 
nected maximal nodes where the node p is one of these maximal nodes. 

Proof. Suppose that the states q and p are distinct maximal nodes in some 
connected component X of the graph F{Ei). The graph T of a piecewise testable 
deterministic finite automaton has only trivial SCC ftheorem tblll . whence p q 
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or q ^ p. Suppose that q p and let us consider the graph r{S{p)). r{Si) CL 
-r(Z’(p)), whence the node p is a maximal node of the graph /^(Z’(p)). The 
states q and p are connected in the graph T(T'(p)) too. The node q or some 
successor of q is a maximal node in the same component of T(T'(p)) and this 
maximal node is not equal to p because q p. 

Last lemma gives us opportunity to present necessary and sufficient condi- 
tions for piecewise testability in the following form: 

Theorem 53 Let L he a regular language over the alphabet U and let F he 
a minimal automaton accepting L. The language L is piecewise testable if and 
only if the following conditions hold 

(i) F is a directed acyclic graph; 

(ii) for any node p the maximal connected component C of the graph T(i7(p)) 
such that p € C has a unique maximal state. 

Let us formulate the Simon’s result in the following form: 

Theorem 54 . Finite semigroup S is piecewise testable iff S is aperiodic and 
for any two elements x, y € S holds 

{xyYx = y{xyY = {xyY 

6 An Algorithm to Verify Piecewise Testability for 
Automata 

— Check that the graph F is acyclic. If not then the automaton is not piecewise 
testable and the procedure stops. 

— For any state p, let us compute 

• the stabilizer T’(p). 

• the graph T(L'(p)). 

The node p is a maximal node of some SCC of the graph T(L'(p)) and 
our aim is to find some more maximal nodes of the component or to 
prove their absence. 

• non-oriented copy N of the graph F{S(j))) and maximal connected com- 
ponent C of N that contains p. A linear depth-first search algorithm is 
used for to find C. 

• for any node r (r yf p) from C let us do the following: 

If r has no outgoing edges in the graph T(T’(p)) or all its outgoing edges 
are loops then the automaton is not piecewise testable and the procedure 
stops. 



Theorem 61 Let L he a regular language over the alphabet F and let F be a 
minimal automaton accepting L. The considered algorithm to verify the piecewise 
testability of the language L has order 0{n^) where n is the sum of nodes and 
edges of the graph F. 
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Proof. The number of graphs T(if(p)) we consider is not greater than the 
number of nodes of the graph F. The process of finding of the graph T(d7(p)) 
is linear in the number of nodes of the graph F. 

Depth-first search algorithm is linear in n too. 

The process of excluding loops in the graph T(I7(p)) is linear in the number 
of nodes of T(T'(p)). 

The finding of node without outgoing edges is linear in the number of nodes. 



7 An algorithm to Verify Piecewise Testability for 
Semigroups 

The algorithm is based on the theorem El 

— For any x S S' let us find x'^ equal to x® such that x® = x*+^. If is not 
found then the semigroup S is not piecewise testable. 

For given element x, the step is linear in the size of semigroup. 

— For any pair x,y € S, let us check the condition {xy)‘^x = y(xt/)“ = {xy)‘^. 
The validity of the condition for any pair of elements implies piecewise testa- 
bility of S. In opposite case S is not piecewise testable. 

The algorithm under consideration has order 0{n^) where n = |S|. 
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Abstract. The paper attempts a systematic study of homomorphisms 
of relational strnctures. Such structures are modeled as multialgebras 
(i.e., relation is represented as a set-valued function). The first, main, 
result is that, under reasonable restrictions on the form of the definition 
of homomorphism, there are exactly nine compositional homomorphisms 
of multialgebras. Then the comparison of the obtained categories with 
respect to the existence of finite limits and co-limits reveals two of them 
to be finitely complete and co-complete. Without claiming that compo- 
sitionality and categorical properties are the only possible criteria for 
selecting a definition of homomorphism, we nevertheless suggest that, 
for many purposes, these criteria actually might be acceptable. For such 
cases, the paper gives an overview of the available alternatives and a 
clear indication of their advantages and disadvantages. 

1 Background and Motivation 

In the study of universal algebra, the central place occupies the pair of “dual” no- 
tions of congruence and homomorphism: every congruence on an algebra induces 
a homomorphism into a quotient and every homomorphism induces a congruence 
on the source algebra. Categorical approach attempts to express all (internal) 
properties of algebras in (external) terms of homomorphisms. When passing to 
relational structures or power set structures, however, the close correspondence 
of these internal and external aspects seems to get lost. 

The most common, and natural, generalisation of the definition of homomor- 
phism to relational structures says: 

Definition 1.1 A set function 4> '■ A ^ where both sets are equipped with 
respectiue relations C A” and C B", is a (weak) homomorphism iff 

{xi...Xn) e => {4>{xi)...4>{Xn)) G R^ 

* The first author gratefully acknowledges the financial support from the Norwegian 
Research Council. 

^ Underlying sets will be used to indicate the “bare, unstructured sets” as opposed 
to power sets or other sets with structure. For the moment, one may ignorte this 
notational convention. 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 359-Eni 2001. 
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With this definition any equivalence on A gives rise to a weak homomorphism 
and, conversely, a weak homomorphism induces, in general, only an equivalence 
relation on A. Hence this homomorphism does not capture the notion of congru- 
ence and this is just one example of an internal property of relational structures 
that cannot be accounted for by relational homomorphisms (in various variants) . 
Probably for this reason, the early literature on homomorphisms of relations is 
extremely meagre and most work on relations concerns the study of rela- 

tion algebras, various relational operators and their axiomatizations. Although 
in recent years several authors begun studying relational structures and their 
homomorphisms in various contexts, a general treatement of relational homo- 
morphisms is still missing. This growing interest is reflected in numerous sug- 
gestions on how the definition of such a homomorphism could be specialized to 
obtain a more useful notion. This issue is our main objective. 

In a more concise, relational notation, definition o is written as 4> C 
4>; . This somehow presupposes that i? is a binary relation, since composition 

_ has a standard definition only for binary relations. There seems to be no 
generally accepted definition of composition of relations of arbitrary arities. In 
the following we will compose arbitrary relations (within the structures), like 
R above, with binary relations (obtained from homomorphisms between the 
structures), according to the following definition. 

Definition 1.2 The composition of relations R^ C , resp. R^ C , 

with a binary relation (f Q Ax B as a relation on A" x If, is given by: 

(ai...a„, b) G R^\ <() 3a G A : (ai...a„, a) G R^ A (a, b) G 4> 

{ai...an,b) G 4>; & B: {bi...b„,b) G R^ A {ai,bf} G 4> 



This definition is certainly not the only possible one - contain more general 
suggestions. The reason for this choice is our intension to treat relations in an 
algebraic way. It allows us to view relations as set-valued functions and turns 
relational structures into algebraic ones (algebras of complexes from jl 611 6) 1. In 
particular, it admits composition of relations of arbitrary arities analogous to 
composition of functions. 

Now, table ^ presents a sample of proposed definitions of relational homo- 
morphisms gathered from |22IHI7I2()I25I3I23I24| . It uses binary relations but with 
the above definition o it may be used for relations R of arbitrary arity. The 
names are by no means standard and taken from the articles introducing the 
respective definitions. 

This paper is an attempt to bring at least some order into this situation which 
we experience as rather unsatisfactory. Given the combinatorial possibilities of 
defining homomorphisms of relational structures, a complete classification seems 
hardly possible. Even the very issue of the “criteria of usefulness”, depending 
on the intended applications, may be debatable. We hope that a more algebraic 
perspective may bring at least some clarification. Instead of listing and defending 
new definitions, we have chosen the compositionality of homomorphisms and the 
elementary properties of the resulting categories as the basis for comparison. 
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Table 1. Some definitions of relational homomorphisms 



homomorphism 4> 


relational def. 


logical def. Vi, y : 


1. 


weak 




R^{x,y) => R^{<j>{x),4>(y)) 


2. 


loo.se 


R^-.PC 


1. 


3. 


full 


4>^-,R^\<p = p--,<p-, R’^;4>-\(t> 


3x',y' : R'^{x’,y’) R^ (4,{x) , 4>(y)) 


4. 


‘strong' 


4>-;R^-,4,D R’^;4>^\4> 


3x',y' : R^(x',y') <= R^ {(p{x) , <j>{y)) 


5. 


outdegree 


R^-4, = <j>\ 


3x' ■.R^{x',y)z^R’^(4>(x),4>[y)) 


6. 


indegree 


4>~\R-'^ = 4>~;p-,R^;4>- 


3y' : R^{x,y') R^ [<j}{x) , 4>{y)) 


7. 


‘very strong^ 


<j>\ 2 <l>\ 


3x',y' : R^{x',y') R.^ {(j>{x),y) 


8. 


regular 


5. & 6. 


5. & 6. 


9. 


closed 


R^-.4>^ 


3y' : R^{x,y') C= R’^ {(j}{x),y) 


10. 


strong 


R-'^ = 4>\R^\ip- 


R^{x,y) <t4 R^{(ji{x),(l){y)) 


11. 


tight 


-o- 

II 

to 


2. & 9. 



— primed symbol z' denotes some element such that = <t>{z) (in 7 and 9 a?/': = y) 

— for Q Ax B, 4>^ denotes the inverse (b, a) G (p^ (a, b) G 4> 



Section El introduces mulitalgebras as a possible way of representing relations 
and motivates this choice. Section E| addresses the question of composition of ho- 
momorphisms: [^3 gives a characterization of homomorphisms which are closed 
under composition - in fact, most of the suggested defintions, like most of those 
in table Q do not enjoy this property which we believe is crucial. The proof, 
included here, is a significant improvement of the original proof of this result 
from m- Subsection E21 characterizes the equivalences associated with vari- 
ous compositional homomorphisms. Section 0 summarizes the results on (finite) 
completeness and co-completeness of the obtained categories. 

Except for the statement of the compositionality theorem, the results con- 
cerning the three “inner” categories were published earlier in 112 91301 . This paper 
is the first complete overview with the results for the remaining six categories 
from im . 

Preliminary definitions and notation A relational signature A is a pair 
(iS, TZ) where 5 is a set (of sort symbols) and 7?. is a set of relation symbols 
with given arities (also called type of relation), written [Ri : si x ... x s„] G 72.. 
Similarly, an algebraic signature is a pair (5, R) where T is, & set of function 
symbols with associated sorts and arities, written [/ : si x ... x — >■ s] G iF. A 

relational structure over a signature S = (5,72) is a pair A = (|A|,72"^), where 
|A| is an 5-sorted set called the carrier and is a set of relations, such that 
for each [Ri : si x ... x s„] G 72 there is Rf‘ C |A|sj x ... x |A|s^. An algebra 
over a signature S = (5,.F) is again a pair A = {\A\,T^), where \A\ is an 
5-sorted set called the carrier, and is a set of functions, with a function 
: |A|si X ... X |A|s„ \A\s for each [/ : si x ... x s„ s] G .F. 

In order to reduce the complexity of notation, we shall limit ourselves to 
single sorted structures and algebras (5 has only one element) claiming that the 
results carry over to a multi-sorted case. 

We will study algebras over carriers being power sets. For such a structure 
A with |A| = p(A), the set A will be called the underlying set. Given a function 
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/ : A — > i3, we will often use additive pointwise extension without making it 
explicit in the notation - for any X C A, we write f{X) meaning [J^^x fix)- 
Also, we do not make explicit the distinction between elements and one-element 
sets - if A = p(A) and a G A, we write a G A meaning {a} G A. Homo- 
morphisms of multialgebras map underlying sets to underlying sets, just as the 
homomorphisms of relational structures do. 

Composition is written in diagrammatic order f-,g for g(/(-)). For a binary 
relation/function (p, (p~ denotes its inverse = {{y^x) : {x,y) G </)}. 



2 Multialgebras 

Our interest in relational structures originates from earlier studies of multialge- 
bras ^IhllOllMIlVLllITTEinrT^j which provide means for modeling nondetermin- 
ism in the context of algebraic specification of abstract data types 

Multilagebras can be viewed as relational structures with a spe- 
cific composition of relations of arbitrary arities. According to definition II .21 
relations are viewed as set- valued functions where the last, n-th argument corre- 
sponds to an element of the result set obtained by applying the function to the 
first n — 1 arguments. This view appears in was elaborated in Has, then 
in m and re-emerged in recent years in the algebraic approaches to nondeter- 
minism. It is based on the simple observation that any (set-valued) operation 
f : Ai X ... X An — >■ p(A) determines a relation Rf C Ai x ... x An x A and vice 
versa, via the isomorphism: 

Ai X ... X A„ -)■ p(A) ~ p(Ai X ... X AnX A) (2.1) 

Based on this fact, introduced the concept of algebra of complexes which 

we call muUialgebras - with one proviso: the carrier of a multialgebra is a power 
set but Boolean operations are not part of the multialgebraic signature. This 
seemingly negligible differece turns out to have significant consequences, since 
signatures determine homomorphisms. 

Definition 2.1 Given a signature X = (S,iF), a S -multialgebra M is given by: 

— a carrier \M\ = {|M|s}sg 5 , where for each s G S, \M\s = p(M«) of some 
underlying set M with the obvious embedding M .. ^ to(M A: 

— a function f^ : M x ... x Ms„ v(M A for each / : si x ... x s„ — >■ 
s G J- , with composition defined through additive extension to sets, i.e. 

/^(Ai, ...,Xn) = -,Xn)- 

Multialgebras are “partial” in the sense that operations may return empty set of 
values. By the pointwise extension of operations, they are strict in all arguments. 
Notice also that we allow empty carriers, i.e. p(0) = {0} is a possible carrier of 
a multialgebra. 

One consequence of modeling relations by set-valued functions is that pos- 
sible requirement on the definitions of homomorphisms to be invariant under 
permutation of variables becomes little (if at all) relevant. Such a requirement is 
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not satisfied, for instance, by homomorphisms 5., 6., 7., 9. and 11. in the tabled 
so, using this criterion, one would claim that these definitions are not “appropri- 
ate”. Functions, on the other hand, do have ordered arguments and, above all, 
distinguish one relational argument as the result. Thus, the definitions of homo- 
morphisms are indeed invariant under permutation of the argument variables, 
but there seems to be no reason to require such an invariance under permutation 
of (some) argument variable(s) with the result. 

Although multialgebras introduce some structure not present within the cor- 
responding relational structures, the isomorphism JZH) allows one to convert 
any multialgebra to the corresponding relational structure and vice versa. Mo- 
rover, any definition of a homomorphism of multialgebras can be transfered 
to the relational structures and vice versa, although this may require forget- 
ting/introducing the distinction between the argument and result variables. We 
now turn to multialgebraic homomorphisms but, for convenience, we will mostly 
use relational notation. 

3 Compositionality of Multialgebraic Homomorphisms 

Theorem 13.51 which is the main result of this section, gives an exhaustive char- 
acterization of compositional definitions. We begin by giving a counter-example 
for compositionality of one of the homomorphisms from table Q 

Full homomorphisms were considered in |TTOT] as the homomorphisms be- 
tween relations. In a more special form, they also appear in the study of partial 
algebras |S| • In the context of partial algebras, it is known that these homomor- 
phisms do not compose. But it is not clear if all the authors were aware of this 
fact in the general setting. 

Example 3.1 Let A,B,C be structures with one relation R - a) presents a 
many-sorted counter-example and b) the single-sorted case. 

(f> 'Ip (p Ip 

a) A ^ B 3- G b) A s- B a- C 

\ 





Both (p and ip are full homomorphisms. However, due to non- surjectivity of p, 
the composition p] ip is not full. Although p; ip{0) = 0 and (0, c), resp. (0, 0) S R'^ 
there is no element x € A in the pre-image of c, resp. 0 for which we would have 
{0,x) e i?^. 

3.1 Compositional Homomorphisms 

We assume a fixed relational signature, with R ranging over all relation symbols, 
and consider definitions of homomorphisms p : A ^ B of the form 

A[p] ^ li[p];R'^;n[p] Ix] h[P]; ; r2[p] 



b A 
\ 

0 1 




(3.2) 
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where ^[-]’s and r[_]’s are relational expressions (using only relational composition 
and inverse and parameterized by _), and txi is one of the set-relations {=, C, D}. 



Definition 3.2 A definition is compositional iff for all cj) : A ^ B , : B ^ C , 

we have & A[tp] => i.e.: 

IXI ^2 [<(>]; & 

ixi l 2 [t/j]]R^-,r 2 [fi] 

h[fi;ipy,R^]ri[(j);tp] cx l2[(f;tlj];R^;r2[fi-,'ip] 

The number of syntactic expressions of the kind l[(p] is infinite, however, since 
homomorphisms are functions we have the simple fact: 

Fact 3.3 a) 4>', 4>~ = (j)~ b) 4’~\ c) 

Thus the length of each of the expression l[fi], resp. r[(fi\ (measured by the number 
of occurring fi's or <()“’s) can be limited to 2. 

On the other hand, both sides of a definition from (13.211 must yield relational 
expressions of the same type, i.e., of one of the four types Ax A, Ax B , ..., which 
will be abbreviated as AA^ AB, ... 

For each choice of ex, this leaves us with four possibilities for each type. For 
instance, for AB we have the following four possibilities: 

T ab '■ IXI 4>] R^ 'A~ 4> J-As ■. R^-,4> 4>iR^ 

Eab '■ cx 4>]R^ Wab'-R^'A 4>iR^A~A 

The symbols denoting the respective possibilities are chosen for the following 
reason. Relational composition preserves each of the relations ex, i.e., given a 
particular choice of ix and any relations C,D (of appropriate type), we have: 
i?i X i ?2 => C\Ri X C;i ?2 and i?i x i ?2 Ri\D x R 2 ,D. Starting with 

J-ab and pre-composing (on the “Fast”) both sides of x with 0;^“;(_), we 
obtain Eab] post-composing (on the ‘West”) both sides of x with (_);</>“; ^, 
we obtain Wab ■ Dual compositions lead from there to T ab ■ Thus we have that 
Eab =A Eab , Wab ab and the corresponding lattices are obtained for the 
other three types starting, respectively, with 

Eaa ■ R^ IX </>; R^] <P~ Ebb ■ (t>~\R^\ x Eba ■ 4>~]R^ x R^\ A 

Figure [D shows the four lattices for each type (the choice of x is the same for 
all of them). 

The additional equivalences (indicated with dotted arrows) are easily verified 
using the fact that composition preserves each of x and Fact 13.31 Also all the 
top definitions are equivalent which follows by simple calculation. 

These observations simplify the picture a bit, leading, for each choice of x, 
to the order of 9 possible definitions shown in figure 0 

Furthermore, choosing C for x, the above ordering collapses. 

Proposition 3.4 All definitions (of the form iti.Al } involving C are equivalent. 
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T AB 



T aa 




(.-);■/>- 






4> ',R \4> R 



C-l'.'f' ■,4> 






' ; R [XI R ; (p 



0 :■!>;(-) 



Fig. 1. Lattices for each relation type for each choice of ixi). 




J-SS -i-AB J-BA J-AA 



Fig. 2. Possible definitions (for a given choice of cx]). 



We are thus left with one definition involving C and 18 other definitions obtained 
from two instances (with =, resp. 3 for cxi) of the orderings in figure |3 The 
following, main theorem shows that only the bottom elements of these orderings 
yield compositional definitions. 

Theorem 3.5 A definition is compositional iff it is equivalent to one of the 
following forms: 

1) 2) (j)--R^-(j)>R^ 3) (j)--,R^t>R^](j}- 4) R^xfiR^^cf- 

where Cxi G {=, C, D} and > G {=, 3}. 

Proof: For the “if” part, one easily checks that l)-4) do yield compositional 
definitions. In fact, this part of the theorem holds for any transitive set-relation 
[XI. For instance, for 3) we verify: 

4>~ ; R^ R^ ; 4>~ & ; R^ R^ ; 'ijj~ 

=> ijj~; 4>~;R^ IX iIj~; R^; (j)~ & iIj~; R^; (j)~ x R^; (j)~ 

^ ( 0 ; ' 0 )"; R^', ’>P)~ 

The “only if” part is shown providing counter-examples for the remaining possi- 
bilities. Although there are 10 cases left, they are easily shown by the following 



366 



M. Walicki, A. Hodzic, and S. Meldal 



three counter-examples. In all cases, the given homomorphisms satisfy the 
respective definition with = for c> (hence, also for D), while their composition 
does not satisfy the respective definition with D for c>. Thus we obtain immedi- 
ately counter-examples for both > S {=,2}. 

Vertical arrows represent the relation (R) in respective multialgebras; the dotted 
arrows illustrate the images under the respective homomorphisms: 











4>b 


Ipb 








A- 




~^C 


A- 




~^c 


A- 


^B- 


~^C 




h 






bs 












f" 






. ■ ■■" t 










02 


^ &2 


C2 


02 


b2 


C2 




^2 




t 


t 


'd 


t 


t 


Pa 
< > 






■ A 


Oi 




^ Cl 


Ol 


^1 


Cl 


Ol 


^ bi 


> Cl W 




a) 






b) 






c) 





a) for Wbb ■ We have: (j)^ ; (j>a = R^]4>a',(t>a and 

ijj~; R^;tpa = '*Aa ; V'a- However, for the composition pa = we have 

(c 2 ,ci) G R^;p-;pa but (c 2 ,ci) ^ p~;R'^;pa, i.e., p~;R^;pa 2 R^',Pa,Pa- 

b) for Ebb ■ ^ 05 R^ is quite analogous, cj)'^; R^; (f>b = R®; (pb': 4>b 

and V'fT;R^;V’b = R*^; V’fT; '*/’&> Pb\R^\Ph 2 Pb',Pb;R^ with (C 2 ,ci) as a 
witness to this negation. 

Both these examples can also be used as counter-examples for compositionality 
of T, represented by T bb- For instance, in the first case, we have R^] (pa'jipa = 
R^ ]4’at‘Pa and the corresponding equality holds for ipa and R^ - so 
exactly the same argument yields a counter-example also for this case. 

c) for both W AAjAB and Ejyj^jBA (this is essentially exa.mnle ld. Il bl. Both (pc and 
ipc are obviously Wab'- R^',<pc = (pc]R^',4>c'Ac and R^',ipc = 'fpc, R'^' ',tpc]tpc- 
However, their composition yields: 0 = R^\ pc 2 PcJ R*^; Pc'^ Pc = (ci, ci). 

This gives also counter-example for Eba ■ (pc', R^ > (pc', (pc', R^', (Pc- FI 

This leaves us with 9 basic compositional definitions (more can be obtained by 
their conjunctions (see HU)- Inspecting the table Ql we can see that 1. and 2. 
define actually the same notion, and the only other compositional definitions are 
9., 10. and 11. 

Notice that, although we have used a rather special definition of relational 
composition, all counter-examples involve only binary relations. Thus, even if 
one defined composition of relations differently, as long as it subsumes the stan- 
dard composition of binary relations, the theorem gives the maximal number of 
compositional definitions of homomorphisms. 

On the other hand, one might probably come up with other forms of defining 
homomorphisms that are not covered by our theorem. 0 However, the majority 

^ E.g., using complementation in addition to composition and inverse, or else using 
relations instead of functions as homomorphisms. We have to leave such generaliza- 
tions to future research. 
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(if not all) of commonly used forms do conform to this format. Occasionally, 
some authors consider certain modifications of the definitions from table Q For 
instance, full outdegree and indegree homomorphisms (3,5,6) with the extra sur- 
jectivity requirement do compose. This restriction merely enforces the equality 
(j)~](t) = ids, and leads to a special case of cases 2), 1) and 3) from the Theo- 
rem ^21 respectively. 

3.2 Congruences on Multialgebras 

Congruences of relational and power structures were studied, for instance, in P] 
I2|3|1E|. As observed before, any equivalence gives rise to a (weak) homomor- 
phism. However, the more specific definitions from th eorem 13 . 01 m av lead to more 
specific relations. We consider first equational definitions from the theorem, and 
characterize these kernels which turn out to be not merely equivalences but 
congruences of a sort. To proceed further we need a notion of a quotient: 

Definition 3.6 Given a structure A = (A, and an equivalence ~ C 

Ax A, a quotient A/^ = Q is given hy Q = {[x] : x £ A} and Rf = (j)~; Rf; (f>, 
where [x] = {y £ A \ x ^ y}, and (f> : A^ Q is defined by 4>{x) = [a;]. 

Obviously, ^ is the kernel of (f), i.e., x ^ y iS 4>{x) = 4>{y), and ~ . 

Proposition 3.7 Let ^ be an equivalence on A and Q, be as in def. I,V. tl 



if ^ satisfies then 



2 ) 








1 ) 






R^; (l) = (j);RQ 


3 ) 






fi-;R^ = RQ;fi- 


4 ) 




- = R^ 


rA = (j)- rQ-^~ 



In 1), 3) and 4) the relation ^ is not just an arbitrary equivalence but has a 
flavour of a congruence: 

1) can be stated as: Va, 6 : {3b' ^ b,d' ^ d : R^{d! ,b')) i3b' ^ b : R^{a,b')), 

which gives: Vo, 6, d' : R^{d, 6) A a ~ a' => 36' ^ 5 : R^{d', 6'); 

3) yields a dual condition: Va, 6, 6' : R^{d, 6) A 6' ~ 6 3a' ~ d : , 6'); 

4) is strongest: Va, 6, a', 6' : R^{d, 6) A a' ~ a A 6' ^ 6 ^ R^{d', 6')0 

For any (at least weak) homomorphism we have the converse of tt. /t 
Proposition 3.8 Given a homomorphism : A ^ B , let be the kernel of 





if (j) : A ^ B satisfies 


then ^ 


is an equivalence and 


1 ) 


R^] 4> = (t>]R^ 






3 ) 


(j)-]R^ = 






4 ) 


rA = (j)- rb. 







^ Following a suggestion from an anonymous referee, one might want to compare this 
case to the notion of bisimulation of process. 
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There is no line for condition 2) since (j)~; (j) = obviously implies that ^ 

is an equivalence but, in fact, this follows for any mapping cj). 

This is not the strongest formulation of this fact. For ^ being an equivalence 
it suffices, of course, that (/> is a weak homomorphism. Furthermore, for instance 
1) implies R^](j) = (f)\ R^ ] (j)~ ] cj) which is sufficient to establish the respective 
property of In general, since ^ is induced only from the image of A under (f>, 
restricting the homomorphisms’ definitions on the Reside to this image (i.e. by 
(j)) will yield the same properties of 

Similar results do not follow for the “closed” versions, i.e., for the homo- 
morphisms defined by 3 in place of =. We can uniformly replace = by 3 in 
nronosition 13.81 but then the statements in the right column are trivial for any 
mapping cj). If the target algebra is total then the kernel may retain the flavour of 
congruence. But, in general, “closed” homomorphisms induce only equivalence. 



4 Categories of Multialgebras 

Theorem 13.51 gave three possibilities for ixi. The category substituting for [xi 
the relation C is called “weak”, those with = are “tight”, and those with A 
“closed”. (These names are, to some extent, motivated by the tradition within 
partial algebras and multialgebras.) In lack of better names, we then call the 
categories of kind 1) “inner” (since 4>~,4> occur “inside”, closest to ixi), of kind 
2) “left”, of kind 3) “outer” and of kind 4) “right”. For a given signature S, we 
thus have 9 categories, with homomorphisms (f> : A ^ B given by the respective 
compositional definition: 





inner 


left 


outer 


right 


closed 


MAIg,c(if) : 
R^;cl>2 (j); R^ 


MAIg^c(^) : 

cf)--,R^-(j)A RB 


MAIgQ(-,(Z') : 


MAIg^c(^) : 
R^ D (j); R^;(j)- 


tight 


MAIg,j,(I7) : 
R^- (j) = (j)-,R^ 


I^AIg2^2n(i7) : 

(j}-;R^-,(l) = R^ 


MAIgQjn(i7) : 

= R^;ct>- 


MAIg^^(T') : 
ra = (j)- rB-cJ)- 


weak 






R^\(j)Q (j)] R^ 





Table El summarizes finite (co-)completeness of the respective categories: ‘-I-’ 
means the existence of the respective (co-)limit for arbitrary 17; ’ indicates that 

there exists a counter-example: a signature S and 27-multilagebras for which the 
respective (co-) limit does not exist. 

With a few exceptions, the positive results are obtained by constructions similar 
to (though never the same as) those in the standard category of algebras. The 
proofs for the inner (and weak) categories can be found in [2S|, and for the 
remaining ones in un- 
it has perhaps been a prevailing opinion among mathematicians interested 
in the question that weak homomorphisms of relational structures provide the 
most useful notion. They were certainly the most commonly used ones. The 
above results do, if nothing else, justify and demonstrate this opinion: the cate- 
gory MAIg^y(T') possesses many desirable properties not possessed by the other 
categories. There is, however, an exception to this claim, namely, the category 
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Table 2. Finite limits and co-limits in the categories of multialgebras 





initial obj. 


co-products 


co-equalizers 


terminal obj. 


products 


equalizers 


MAIg;.j,(r) 


+ 


+ 


-f 


+ 


-f 


-f 


MAig,c(i:) 


- 


- 


- 


+ 


- 


- 


MAig,y(r) 


- 


- 


-f 


- 


- 


- 


MAIg^c(^) 


- 


- 


-f 


+ 


- 


- 


MAig^^(r) 


- 


- 


-f 


- 


- 


- 


MAIgoe(r) 


-f 


+ 


- 


+ 


- 


+ 


MAIgo^(r) 


-f 


+ 


-f 


(+) 


? 


+ 


MAIg«c(^) 


+ 


+ 


-f 


+ 


+ 


+ 


MAig«^(r) 


-f 


- 


- 


- 


- 


+ 



MAIg^(;;;(Z’) which, too, is finitely complete and co-complete. A possible ex- 
ception is also the category MAIgQjn(A'). We are almost certain that terminal 
objects, marked with (-I-), exist and can be obtained by an interesting kind of 
term-model construction which has been shown to work in special cases. Also 
products, marked with ‘?’, remain still under investigation^ If the conjecture 
about completeness of MAIgQ 2 n(A') turns out to be true, this might be the most 
interesting of all the investiagted categories. This can be further strengthened by 
observing that (/)“; = R^; </)“ implies R^] ^ C R^ , so that properties of the 

most common, weak homomorphisms are actually implied by outer-tightness. 

5 Conclusions 

We started by considering the relational structures but the suggested definition of 
composition of relations of arbitrary arities turned such structures into algebras 
- namely, multialgebras. These provide a convenient way for algebraic study 
of relational structures, as well as for modeling phenomena like partiality and 
nondeterminism within a unified framework. 

We have given, in theorem 13.61 a general characterization of compositional 
homorphisms of multilagebras. The characterization applies equally to relational 
structures. Thus, from the manifold of alternative proposed definitions of such 
homomorphisms, we identified 9 which allow a categorical approach. We have 
also described the equivalence relations which emerge as kernels of the corre- 
sponding (compositional) homomorphisms and which have various flavours of 
congruences. 

We have then studied in detail the obtained categories with respect to the ex- 
istence of (finite) limits and co-limits. The categories with weak and right-closed 
homomorphisms, MAIgj^j^ (A), MAIg^^(A'), possess all constructions. The former 
has been widely used and is probably the most accepted standard. It remains to 
be seen whether the structural properties of the latter category demonstrated 
here will offer grounds for comparable applications. We conjecture that also the 

Hopefully, the details and proofs of these constructions will be completed by the 

time of the conference. We suggest the interested reader to contact the first author. 
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category MAIgoy(Z') is finitely (co-)complete and may be, in fact, highly inter- 
esting, but its terminal objects and products need further investigation. 

The remaining categories have much poorer structural properties. Thus, ac- 
cepting compositionality and the categorical properties we have studied as the 
criteria for the evaluation of the homomorphisms, our results leave quite limited 
choice. These criteria may, of course, be debatable and we by no means claim 
their universal validity. However, even if one does not want to base one’s choice 
exclusively on these criteria, the results of this paper, in particular, the charac- 
terization of compositional definitions, can provide a useful tool preventing one 
from looking for new, idiosyncratic notions serving only very peculiar purposes. 

We have studied only the categories of all multialgebras over a given sig- 
nature. The really interesting question might be whether the properties of (co- 
)completeness can be lifted to axiomatic classes. The problem with answering 
such a question is that one would first have to decide on the logical language in 
which one writes the axioms. This question is far from settled and faces different 
proposals from different authors working on multialgebras. Still, even though we 
have not considered axiomatic classes, our results can serve, at least, the negative 
purpose. We have studied axiomatic classes over empty sets of axioms - hence, 
at least the negative results, the demonstrated non-existence of some construc- 
tions, will remain valid when lifted to axiomatic classes specified in arbitrary 
languages. 
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Abstract. An autonomous automaton is a finite automaton with output 
in which the input alphabet has cardinality one when special reduced. 

We define the transition from automata to semigroups via a represen- 
tation successful if given two incomparable automata (neither simulate 
the other), the semigroups representing the automata are distinct. We 
show that representation by the transition semigroup is not successful. 

We then consider a representation of automata by semigroups of partial 
transformations. We show that in general transition from automata to 
semigroups by this representation is not successful either. In fact, the 
only successful transition presented is the transiton to this semigroup of 
partial transformations together with its generating set, and in this case 
success occurs only with autonomous automata. 

1 Introduction 

Let V = (Q,A,B,Oj*) be a finite automaton with output, where Q,A,B are 
finite, nonempty sets; Q x A — ^ Q is a function and Q x A i? is a surjecive 
function. The sets Q, A, B are called the set of states, the input alphabet and the 
output alphabet, respectively. The mapping o is called the transition function of 
the automaton V and * is called the output function of V. 

Let Q = {qi,q2,...,qk}, A = {oi, 02, . . . , a^}, B = {61, 62, . . . , 6„}. In P 
Balode and Buis considered the following representation of V by partial trans- 
formations on Q. For (ai,bj) G A x B define 



Let P{V) denote the semigroup generated by y(A x B). That is, P{V) = (Imy). 
The semigroup P{V) is called an autonomaton semigroup. 

Simulation was first discussed by Hartmanis 0 forty years ago. This concept 
describes the possibility on abstract level in which one machine could be replaced 




where for every s 




qsOUi, ifqs*a., = bj 
undefined, otherwise 



5 U-2 — Uj , 

otherwise. 
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by other in applications. If we like to treet the automata by semigroups as it 
done till now and develop the theory not only as selfsufficient discipline the 
connections between simulation and semigroups should be considered from every 
point of view too. Thus we say that a transition from automata to semigroups 
through some representation is successful if it adequatly characterizes simulation. 
In this note we demonstrate that the transition from autonomous automata to 
automaton semigroups can only be deemed to be successful if we consider not 
only the automaton semigroup but its generating set of partial transformations 
as well. That is, the representation which maps an automaton V with output 
to the pair (Imy, P{V)) yields a successful transition in the case of autonomous 
automata but the transition to the representation of F by P(y) is not successful 
even in the autonomous case. 



2 Successful Transitions 

If C and 'C are alphabets, any mapping C — ^ 'C can be extended in the usual 
way to a mapping, also denoted h, from C* to 'C* . Thus ii V = {Q, A, B, o, *) 
we may extend the mappings o and * to Q x A* by defining 

qo e = q, qo (ux) = (q o u) o x 

q * e = e, q * (ux) = (q * u)((q o u) * x) , 

for all q G Q, (u,x) G A* x A, and where e is the empty word. Henceforth, we 
shall omit parantheses if there is no danger of confusion. So, for example, we will 
write qo u* X instead of (qou) * x. 

Definition 1 . Let V = (Q,A,B), 'V = {'Q,A,B) be automata. We say that V 
simulates 'V by 

'Q^Q, A-^ A, B^'B 

if the diagram 

'QxA*^ 'B* 
hf t ^2 t ^3 
Q X A* ^ B* 

commutes. That is, if 

' q * ' u = h^(hiC q) * h2C u)) for all ('q,'u) G 'Q x A* . 

We write V > 'V(hi,h2,h^) if V simulates 'V by h\,h2,h^. We say V 
simulates 'V if there exist maps such that V > 'V(hi,h2, hz). We write V >'V 
if V simulates 'V . 

The two automata V and 'V are incomparable iiVi^'V and 'V ^ V. If, on 
the other hand, V >'V and 'V >V then we say that V mutually simulates 'V 
and we write V cxi 'V . 

Let V = (Q,A,B) and 'V = {'Q,A,'B), and let q G Q,'q G 'Q. The states q 
and 'q are called distinguishable if there exists a, u G A* such that q * u 'q * u. 
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Otherwise, these states are said to be indistinguishable. If every state in Q is 
distinguishable from every other state in Q then the automaton is called a reduced 
automaton. 

The automaton V distinguishes the letters 01,02 if there exists a state q G Q 
and a word u G A* such that q * oiu ^ q * 02U. A reduced automaton that 
distingushes every pair of letters in its output alphabet is called a special reduced 
automaton. By |5j, for every automaton V there exists up to isomorphism a 
unique special reduced automaton 'V such that V ixi 'V . Because of this we may 
restrict our attention to special reduced automata. 

We say that a representation of finite automata with output is successful if 
incomparable special reduced automata have nonisomorphic representations. 

The focus of our investigations will be so-called autonomous automata which 
provide the counterexamples that we need to demonstrate that our representa- 
tion of finite automata by automata semigroups cannot be deemed successful. 

Definition 2. An automaton V = {Q, A, B) is called autonomous if 

\/q G QVa G A\/x G A (q o a = q o X A q * a = q * x). 

Thus the input alphabet A of a special reduced atonomous automaton has pre- 
cisely one letter. 

2.1 Representation by T(Q) 

Let T{Q) denote the semigroup of all transformations on the set Q. For each 
Ui G A define a{ai) G T{Q) by 

Oi{ai) = "^^l^ere Vs (g' = g, o at). 

V 9 i 92 ■ • • 

We say (Imo) is the automaton V transition semigroup. 

Proposition 1. There exist special reduced incomparable autonomous automata 
with the same transition semigroup. 

Consequently, the transition semigroup representation of finite automata 
with output cannot be regarded as successful. 

2.2 Representation by Similar Partitions 

Define DomF = { Domtr | a G Imy }, where Domcr is the domain of the partial 
transformation tr. The following result is easy to establish. 

Proposition 2. IfV is autonomous then DomF is a partition of Q. 

Two equivalence relations, S and 'S on Q and 'Q, respectively, are said to be 

f 

similar if there is a bijection Q — >• 'Q such that S{q, z) <t 4 - 'S{f{q),f{z)), for all 
q, z G Q. Likewise, two partitions are similar if they induce similar equivalence 
relations. 

The following proposition is easily verified. 
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Proposition 3. Let V and 'V he isomorphic special reduced autonomous au- 
tomata. If 7 and '7 are the representations of V and 'V , respectively, by partial 
transformations on Q, then DomR and Dom'R are similar. 

However, we also have the following. 

Proposition 4. There exist incomparable special reduced autonomous automata 
V and 'V with DomR = Dom'R. 

3 Similar Sets of Partial Transformations 

Definition 3. Sets T C PT{Q) and 'T C PT{'Q) are called similar if there 
exist bijections 

g 'Q, T -^'T 

such that a diagram 

Q ^ Q 

fi if 

'Q 'Q 

is commutative for all a € T. 

So, this means that 

(i) qa is defined iff qftp{a) is defined (we write qfif{a) instead of {'tp{a)){f{q))); 

(ii) if qa and qfip^a) is defined then qaf = qftp{a). 

Theorem 1. Let Ax B PT{Q) and 'A x 'B PT{'Q) he representations 
of special reduced autonomous automata V and 'V respectively. Then V and 'V 
are isomorphic iff Im7 and Im'7 are similar. 

Theorem 2. Let V = (Q,A,B) and 'V = {'Q,A,'B) are special reduced au- 
tonomous automata with one and the same automaton semigroup. If for every 
a G Im7 with |Dom(r| = 1 there is a 'a € Im'7 such that a = ' a then V and 'V 
are isomorphic. 

We conclude by stating that the condition on the members of Im7 of domain 
size one is crucial. 

Proposition 5. There exist incomparable special reduced autonomous automata 
with one and the same automaton semigroup. 
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Abstract. Since a quantum computational system is a generalization 
of a classical computational system, its computational power should be 
greater or equal than that of the classical system. In spite of that the 
computational power of 1-way quantum finite automata has been shown 
to be smaller than that of their classical counterpart. I argue that this 
paradox lies on the ground that the currently accepted dehnition of quan- 
tum automaton neglects the concept of quantum reversibility. In this ar- 
ticle I review the role that reversibility plays into quantum computing 
and I propose a new model of 1-way quantum finite automata whose 
computational power is at least equal to that of classical automata. 



1 The Current Model of 1-Way Quantum Finite 
Automaton 

l-way quantum finite automata has been defined by Kondacs and Watrous 0. 
A quantum finite automaton Q is a 6-tuple 

Q = {H, |so), HacC) P&cc, A, Uj^J 

I 1 2 

where H is the internal states Hilbert space, |so) S H with ||so)| = 1 is the 
initial state vector, ilacc C iJ is the accepting space. Pace is an operator that 
projects into Pacci A is the input alphabet (composed of classical symbols). For 
each symbol a G A there is a unitary transition matrix Ua acting on H: U is 
the set of these matrices. 

Using the shorthand Uyj = the language accepted by Q is 

the function f^{w) = |PaccPu)|so)P from words w = W\W 2 ■ ■ - Wk G A* to 
probabilities in [0, 1]. 

The input alphabet consists of classical elements, whereas the internal states 
are represented by quantum symbols (by vectors of a Hilbert space): this au- 
tomaton is not fully quantum. Moreover, it is neither reversible. In fact from the 

* This work is for the most part based on my degree thesis. I particularly tank my su- 
pervisor, Prof. Giuseppe Trautteur, and my co-supervisor. Prof. Vincenzo R. Marig- 
liano, both at the Dipartimento di Scienze Fisiche dell’Universita di Napoli “Federico 
IF. 
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final state f/u,|so) one cannot retrace the computation as w is unknown. To re- 
trace the computation one will need some information that is not encoded in the 
final state, i.e., the operator U^j- As to the computational power of this model 
of quantum automaton, first A. Kondacs and J. Watrous 0 have showed that 
there is a regular language it cannot accept. Then, A. Brodosky and N. Pip- 
penger P| have demonstrated that it cannot recognize a whole class of regular 
languages. Its computational power is then smaller than that of 1-way classical 
finite automata. 

2 Quantum Reversibility 

To perform a quantum computation we apply to the quantum system that cod- 
ifies the information we want to process some unitary operators. Quantum com- 
puting is a reversible process because unitary operators are invertible. 

It is well known that every classical irreversible computation can be trans- 
formed into a classical reversible one introducing source and garbage, and that 
the garbage can be recycled so that it grows linearly with the input size |llf| . We 
don’t know if it is possible to achieve this result in the quantum framework, but 
certainly it cannot be achieved with the same trick used in the classical case. In 
fact to recycle the garbage using the classical trick a copy operation is needed, 
but in the quantum framework a copy operation is not always allowed because of 
the no-cloning theorem. So, in general, a quantum computation produces some 
garbage that cannot be recycled. 

In quantum computing we are not interested in computing backward, and 
typically we cannot compute backward because we lost reversibility performing 
the final measurement. So, is there any good reason to store the garbage! Clas- 
sically, if we compute with a reversible system but we don’t want to compute 
backward, we can erase the garbage and use it as new source, improving the 
consumption of space. Quantum mechanically this is not allowed. In the general 
case, in fact, the garbage is entangled with the computational system and we 
cannot erase it without messing the computation. 

To summarize, the unitary evolution imposed by quantum mechanics im- 
plies that quantum transition functions and quantum gates must be reversible. 
Entanglement forbids the construction of an irreversible quantum system for 
information processing based on these reversible building blocks. 

3 A New Model of 1-Way Quantum Finite Automaton 

To construct a quantum automaton that is fully quantum and strictly reversible 
we can generalize to the quantum case a classical model of reversible automa- 
ton. Does a model of 1-way classical reversible finite automaton exist? It turns 
out that if we apply Bennett procedure to transform an irreversible Turing 
machine that moves only to the right and that doesn’t write on its tape (i.e. a 
1-way classical finite automaton) into a reversible one, we end up with a 2-tape 
reversible Turing machine that moves only to the right and that writes on its 
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Fig. 1. Quantum finite automaton. It is supposed that the internal state and the states 
of the cells factorize. 



tape^. I call this system “1-way reversible finite automaton”, and here below I 
generalize it to the quantum case. 

A quantum finite automaton Q is a 9-tuple: 

Q ~ (^in> l®o)> Hg,cc, -Pacc) 5'G, I, SG, U) 

where is the Hilbert space of internal states, |so) S with ||so)|^ = 1 
is the initial internal state, Hacc Q is the accepting space. Pace is an 

operator that projects into Paccj I and SG are two Hilbert spaces, I and SG 
are two quantum tapes whose cells are quantum systems described respectively 
by vectors of I and SG. U is the (time independent) unitary evolution operator, 
U : Pjjj ®I®SG^ SG. 

The input alphabet is composed by an orthonormal basis of I. The dimension 
of I depends on the automaton we are constructing. The dimension of SG is 
related to the way the unitary evolution operator U acts. 

The state of the cell of I is described by a vector belonging to the space 
Ik = I, while the state of the ‘cell’ of SG is described by a vector belonging 
to SGk = SG. If the input string is |rc) = |wi)|w 2 ) ■ • ■ |w„), with \wk) G Ik, then 
the tape SG contains | 0 i)| 02 ) . . . |0„), with |0fc) S SGk. This situation recurs 
in all models of reversible computation: the value of the source must be set 
appropriately at the beginning of the computation. 

In order to understand how this automaton works it turns useful to define: 

C/i = Ii (g) I 2 ® . . . 0 Ii_i (g) li+i (g) . . . (g) I„ (g) , 

where Ifc is the identity of Ik 0 SGk and Ui the unitary operator U when applied 
to (g) Ji (g) S'Gi. Moreover we define 



|0)" = |0i) ® IO 2 ) 0 . . . ® |0„) , 

^ It is not useful to add a 3rd tape as Bennett [Q did because, as pointed out in the 
previous section, in the quantum framework we cannot use it to recycle the garbage. 
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with |0fe) G SGk- Using this notation |w) = |tui)|w 2 ) ■ • ■ \wn) is accepted with 
probability 

-P(k)) = |PaccU„C/„_i...C/i|so)®|tc)®|0)"|^ . 

It is worth noting that, although when you apply Uk you formally act on the 
space ® SGi) ® actually: 1. A local operation is performed, and 

2. The states of A and SGi with i> k do not modify. Therefore, when Uk with 
k < i is applied, you do not have to dispose the cells A or SGi. This further 
implies that at the beginning of the computation the tapes length has not to be 
specified and that more cells can be added as the computation moves on. This 
is exactly what happens in the classical case. 

4 Computational Power 

From we can deduce that every 1-way classical finite automaton can be effec- 
tively transformed into a 1-way classical reversible finite automaton. Our model 
of quantum automaton can clearly simulate this classical reversible automaton, 
and then it has at least the same computational power of classical automata. 

5 Conclusion and Open Questions 

The main result of this paper is to clarify the relationship between quantum 
computing and reversible computing. It is argue that quantum computing is 
reversible because of unitary evolution and entanglement. 

Stressing the role of reversibility, a new model of 1—way quantum automaton, 
at least as powerful as 1-way classical finite automata, is proposed. This is a 
relevant result since the computational power of the currently accepted model 
of 1-way quantum finite automata is smaller than that of 1-way classical finite 
automata. 

This paper rises the question of space consumption of quantum computing 
and that of the computational power of the proposed model. 
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Abstract. 1.5QTM is a sort of QTM (Quantum Turing Machine) where 
the head cannot move left (it can stay where it is and move right). For 
computations is used other - work tape. In this paper will be studied pos- 
sibilities to economize work tape space more than the same deterministic 
Turing Machine can do (for some of the languages). As an example lan- 
guage (0*T|i > 0) is chosen, and is proved that this language could be 
recognized by deterministic Turing machine using log(i) cells on work 
tape , and 1.5QTM can recognize it using constant cells quantity. 



1 Introduction 

Quantum Turing Machine (QTM) is the quantum counterpart of the ordinary 
Turing machine. As an ordinary Deterministic Turing Machine (DTM) it consists 
of head and tape with transition function S:QxSxExQx {t— , j,, — >■} , where 
Q is states of QTM, S-tape alphabet and {^,4_, — >■} represents one of the three 
available head movement directions. Evolutions of QTM required to be unitary. 
For arbitrary Turing Machine to be QTM there exists well-formedness conditions 
(when they are met, machine is QTM). 

These well-formedness conditions for QTM’s could be found in [G] 

1.5 way Quantum Turing Machine (1.5QTM) [AI 99] is a kind of Turing ma- 
chine with two tapes: one contains input word and the other is for work purposes. 
The input tape’s head cannot move left - only right and remains stationary, how- 
ever work tape’s head can move both directions. All evolutions of this kind of 
Quantum Turing Machine must, like in ordinary QTM be unitary. 

Quantum Fourier Transform (QFT) is used in algorithm to make final accept 
or reject decision. QFT is a quantum version of ordinary Fourier transform. This 
transform maps function from time domain to frequency domain. The function 
obtained after Fourier transform of time function has nonzero values only at a 
multiples of 1/r where r is a period of original function. 

As any other quantum operator Quantum Fourier Transform has its unitary 
matrix. When QFT matrix is applied to quantum state \a) its maps it as follows: 

* Research supported by contract IST-1999-11234 (QAIP) from the European Com- 
mision, and grant no. 01.0354 from the Latvian Council of Science. This paper was 
written with financial support from Dati ltd. 
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For example, when applied to |0) QFT maps it to the all other states with 
equal amplitude, namely 1/q so the impact of QFT is similar to Hadamart 
transform. In the case when all states initially has equal amplitudes the impact 
of QFT is just opposite. Single state gains amplitude 1 and all the other 0. 

To further explain use of QFT in language recognition another QTM prop- 
erty should be discussed. That is computation paths interference. When states 
of QTM are measured, and two computational paths interfere then output prob- 
ability is computed by using formula: 

Pa = {^l,a + ^2,a)^ ( 1 - 3 ) 

Where Pa is the probability of the state |a) to be measured and ^i,a and 
^ 2 ,a are amplitudes of two interfering computational paths. When two paths do 
not interfere then the probability to measure state |a) computed by formula: 

Pa = (^l,a)^ + (^2,a)^ (1-4) 

Interference between computational paths take place only when classical in- 
formation both paths operate with is equal. That means that both paths has 
their head position equal as well as symbol on the tape and the moment of time. 
If these conditions are violated then interference between computational paths 
is lost and probability of state |a) changes. 

Further in this paper will be discussed space-efficient language recognition 
problem, using deterministic 1.5way Turing machine and then 1.5way Quantum 
Turing Machine. 



2 Example 

To prove that 1.5QTM can economize space on work tape better than deter- 
ministic TM can, language L is used, containing words in the form (0*1® |t > 0). 
This language can be recognized by probabilistic 2- way finite automata [F 81]. 
problem is then solved using deterministic TM and then 1.5QTM. 
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3 Using DTM 

Using deterministic TM this problem could be solved relatively easy, but it 
requires at least log(i) cells on work tape [LSH 65] to write quantity of zeros on 
input tape. There can’t be more efficient solution because of limitation on input 
tape head movement (cannot move left). So when comparing ones and zeros 
count TM should know exactly how many zeros there was, and most efficient 
way to do that is to write that number in binary (or with other base) code. If 
there exists the real necessity to move head not only to the right, but also to the 
left whole word can be copied to work tape and then all computation performed 
exactly like 2DTM works. However this technique requires linear space on the 
work tape with the respect to the input word length. 



4 Using 1.5QTM 

Tape space could be economized if we consider using 1.5QTM instead of DTM. 
There exists method, how to solve this problem, using only constant count of 
cells on work tape. 1.5QTM, solving this problem, accepting words x G L with 
probability 1 and rejecting those x € L with probability at least 1 — e, where e 
could be arbitrary small. Algorithm of this QTM is similar to that described in 
[G] p.160. 

At starting symbol QTM splits into superposition of working states, and 
starts to read from input tape. This moment QTM branches its computation 
in n parallel paths, and each moves head according to this rule: for each new 0 
QTM reads from input tape head stays where it is for i cycles and then moves 
1 step to the right. If QTM reads I from input tape its head remains stationary 
for n — i + 1 cycles and then moves 1 step right. This warrants that no two 
computation paths can reach right endmarker simultaneously if word is not in 
language L. 

Because j-th path needs u*j + v*(n — j + 1) cycles to reach right endmarker, 
and j^th ^ j path u* jl + v * (n — jl + 1), and for this two numbers to be equal 
u must be equal to v. 

On its way from left to right endmarker all paths checking format of the 
word. If word is not in the form Oilj then all the paths sooner or later will come 
to rejecting state, and the word will be rejected. If the format of the word is not 
violated then Quantum Fourier Transformation is applied to the superposition 
of QTM states when each computation path reaches right endmarker. If they 
reached it simultaneously then QFT results in the single (accepting) state, oth- 
erwise new superposition of states is formed with probability of accepting the 
word no more than 1 /n, where n is paths quantity. This result is formed because 
of computation paths interference. If they reached endmarker simultaneously 
then they all have equal classical information and so interfere. Otherwise there 
is no interference between them and probability to accept is at most 1 /n. 

To control format of the input word work tape is used. For example, when 
path for the first time encounters change from zeros to ones it writes on the work 
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tape ” 1” symbol. When next zero is read from input tape the computational path 
jumps to the rejecting state. 

When each path reaches right endmarker Quantum Fourier Transformation 
(QFT) is applied to superposition of states of QTM. If all paths reached end- 
marker simultaneously then result of QFT is single accepting state with proba- 
bility 1, else accepting state’s probability is at most 1/n. 

%^l9) = ^E”=ik,.o) 

D\rj^k) =4-i n>/c>0, l<j<n 
=-^4- 1 < j < n 
D\qjr) =-)>4. 1 < j < n 
D\qa) =—>- 4 . 



VoAVjfi) = kij) 

VbyllQ.fc) = ki,fc-l) 

Vbylka) = \qa) 
bo/ll^jr) = \qjr) 

ViA\rjfi) = |rj>-j+i)(l) 

^lyllQ.fc) = ki,fc-i) 
^lylka) = \qa) 

VlA\qjr) = \qjr) 

V$A\rjfi) = \qjr) 
y$A\rj,k) = \qj,k-i) 
V$A\qa) = \qa) 

y$A\qjr) = lQ,n-i-|-l) 



Vo.ilrj.o) = \qjr) 

Vo,i\rj,k) = \qj,k-i) 
Vo,l\qa) = \qa) 

Vo,l \qjr) = \qjr) 

ho,lki,o) = ki.n-i-l-l) 
Vo.ilQ.fc) = kj.fc-i) 
Vo,l\qa) = \qa) 

Vo,l \qjr) = \qjr) 
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v$A\'Tj,k) = ki.fc-i) 

^$,ll9o) = ki,n-i-|-l) 

^$,ll9j>) = \qjr) 
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(4.5) 



V#A - QTM initialization matrix, splits initial state of the QTM into super- 
position of working states 

VoA - Zeroes reading matrix, reads zeroes from input tape 

Voi - Matrix changes state of computational path to rejecting (this matrix will 
work if format of the input word is violated and word like OM-^ 0^ is encountered. 

Via - Matrix for the first ” 1” read from input tape, it also writes flag to work 
tape 

Vii - Matrix that read ”l”-es from input tape. 
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Abstract. A new aggregation algorithm for computing the stationary 
distribution of a large Markov chain is proposed. This algorithm is at- 
tractive when the state space of Markov chain is large enough so that the 
direct and iterative methods are inefficient. It is based on grouping the 
states of a Markov chain in such a way that the probability of changing 
the state inside the group is of greater order of magnitude than interac- 
tions between groups. The correctness of the combinatorial aggregation 
is justified by the method of forest expansions developed recently in m- 
In contrast to existing methods our approach is based on combinatorial 
and graph-theoretic framework and can be seen as an algorithmization of 
famous Markov Chain Tree Theorem. The general method is illustrated 
by an example of computing the stationary distribution. We establish 
also some preliminary results on the complexity of our algorithm. Nu- 
merical experiments on several benchmark examples show the potential 
applicability of the algorithm in real life problems. 



1 Introduction 

Our attention is restricted to finite discrete-time Markov chains (although the 
presented approach is valid also for continuous time) . Such a Markov chain over 
a state space S is usually represented by a transition probability matrix P 
of order n, where n is the number of states in S. For a transition probability 
matrix P, any vector tt satisfying tt^ = '?r^P, SieS = 1 i® called a 

stationary probability distribution cf. P). The most elegant way to calculate 
it is to find the analytical formulas for the solution of the system. However, this 
is usually impossible and the only way is to solve the problem numerically [Z|. 
Problems arise from the computational point of view because of the large number 
of states which system may occupy. It is not uncommon for thousands of states 
to be generated even for simple applications. On the other hand these Markov 
chains are often sparse and possess specific structure. 

* This work was partially supported by the KBN grant 8 TllC 039 15 
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Related research. A lot of research has been done concerning the numerical 
solutions of some linear equations that occur when one studies Markov chains 
(see for example EO). Almost all methods for solving a system of linear equa- 
tions are adapted into this context. In this paper we focus on nearly completely 
decomposable Markov chains (see lllTH l. Such chains often arise in queueing net- 
work analysis, large scale economic modeling and computer systems performance 
evaluation. The state space of these chains can be naturally divided into groups of 
states such that transitions between states belonging to different groups are sig- 
nificantly less likely than transitions between states within the same group. For 
solving nearly uncoupled Markov chains, a family of methods has been proposed. 
They are jointly classified as iterative aggregation/disaggregation P| methods, 
and based on a decompositional approach. Our combinatorial aggregation ap- 
proach can be seen as a generalization of existing aggregation algorithms. There 
are several important advantages over previous methods. The presented algo- 
rithm uses combinatorial properties of directed forests in the underlying graph 
of a Markov chain. Combinatorial and graph-theoretic approach simplifies the 
description of algorithm and proof of correctness which relies on certain facts 
about forest expansions of solutions of linear equation systems jS| . 



Directed forests method. A real square matrix A of size s induces a graph 
G{A) with states {1,2,..., s} and edges between all pairs {i,j) with Oij yf 0. In 
G{A) we define the (multiplicative) weight of a forest / = (S,Ef) and the 
weight of a set T of forests by: 

w{f) = n 

It was observed that many facts are valid simultaneously for both discrete and 
continuous time Markov chains. To deal with them at the same time we use, 
following 1^, a laplacian matrix, i.e. matrix L = {hj)l hj G R satisfying 
hi = — '^j- j^i hj for i = 1, . . . , s. Denote by I the identity matrix of size s. It 
is easy to verify that matrix L = I — P is a laplacian matrix. For U,W Q S 
denote by A{U\W) the submatrix resulting from deletion of rows and columns 
indexed by U and W respectively. We express the solution of a system of linear 
equations as a rational function of directed forest weights (called the forest 
expansion, cf. m)- For stationary distribution this is formulated in the well 
known theorem: 



Theorem 1 (Markov chain tree theorem). If the underlying graph of a 
Markov chain has exactly one absorbing strong component, then the stationary 
distribution is given by: 






for i = 1, . . . s. 



For given functions A,B ■. K. — >■ K, the notation A{e) ~ B{e) means that: 
lime_,.o = 1. A family (L(e) = {hj{£))pj^\, £ S (0,£i)} of laplacian matri- 
ces of size s X s is a powerly perturbed Markov chain, if there exist matrices 
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A = (Sij)ij^s, and D = (dij)ij^s, Sij > 0 and dij € M, for i,j € S, such that 
the asymptotic behavior of laplacians L(e) is determined by A and D as follows: 

^ Sij£ . ( 1 - 1 ) 

We also use the concept of powerly perturbed nonnegative vector which is 
defined analogously. Consider the following graph induced by a matrix D (we 
take into account asymptotically nonzero entries): G*(D) = £ S x S : 

Sij yf 0}). For an arbitrary forest / and a nonempty set T of forests in G*(D) 
we study parameters d(/) defined as the sum of dij over all edges (i, j) of forest 
/ and d{T) being the minimal value of d{f) for all forests f £ T . Analogously 
5(/) denotes the product of all Sij and 6{T) the sum of 6{f) over all f £ 
having minimal d{f). We describe the asymptotic behavior of solutions of 
system L^(i?|i?)x = b, related to a powerly perturbed Markov chain, in terms 
of directed forests expansions. It turns out that a solution of a system of linear 
equations, for a perturbed chain, can be treated as a perturbed vector. 

Theorem 2 ([6j). Let matrices A and D be such that (EH) above holds, for 
a powerly perturbed Markov chain {L(£),£ < £i}; let R Q S, where S is a set 
of states. Moreover let vector b = (C,z) be powerly perturbed vector. Suppose 
that there exist a forest with the root R in G*(D). Then the solution x(£) = 
{xi{e))i^S\R of the system L^(i?|i?)(£)x(£) = b(£) satisfies, for i £ S \ R, the 
relation Xi{e) ^ where the coefficients rji, hi are some constants, i = 

2 Combinatorial Aggregation 

Before describing the algorithm for computing asymptotic coefficients h and 
rj, we explain how it can be used to obtain the approximation of stationary 
distribution vector. The algorithm takes as an input laplacian L = (lij) defining 
Markov chain and parameter £ and consists of three steps: 1. construct matrices 
A and D such that: —lij = 6ij£‘^*T where e < Sij < 1; 2. run Algorithm [D to 
compute vectors r} and h; 3. set 7Ti(£) := rjiS^L 

Fast computation of forest expansions. The algorithm reduces the size of 
state-space of a Markov chain by lumping together closely related states. This 
process is repeated in the consecutive phases of aggregation; during each phase 
graphs induced by matrices D and A are considered. The algorithm groups 
states in each closed class of the graph and solves the system of linear equations 
restricted to this class. Smaller size, hence tractable, systems of equations can be 
solved by a direct method. The solutions of these systems are used to upgrade 
the asymptotic coefficients computed for each state of the original Markov chain. 
Before passing to a next phase, an aggregation procedure is performed, lumping 
all states in each closed class into a new, aggregated state. 

The task of computing exponents hi is of quite different nature than the task 
of computing the coefficients rji . While the former can be performed using purely 
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Algorithm 1 Calculate asymptotic coefficient rj and h. 

1: construct G° = 

2: fc ;= 0 
3: repeat 
4: k := k + 1 

5: find partition of into closed classes 

6: construct 

7: for each closed class in say do 

8: construct laplacian Lk 

9: compute stationary distribution i. e., solve the system L^x = b 

10: compute m{I^) 

11: for each aggregated state in do 

12: compute and V*") 

13: for each state i aggregated into state do 

14: upgrade rji = and hi = h{i\l'^) 

15: end for 

16: end for 

17: for all neighbors of class 7*^ do 

18: determine shortest edges 

19: end for 

20: end for 

21: construct new set of aggregated edges 

22 : G'^ := {S^ , E'^) 

23: until G^ has only one closed class 



combinatorial methods (hence precisely), the latter uses a procedure of solving 
a system of linear equations, exposed to numerical errors. 
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Abstract. We present a useful primitive, the hidden number problem, 
which can be exploited to prove that every bit is a hard core of spe- 
cific cryptographic functions. Applications are RSA, ElGamal, Rabin 
and others. We give an efficient construction of a hard core predicate of 
any one-way function providing an alternative to the famous Goldreich- 
Levin Bit [2|. Furthermore, a conjectured connection between universal 
hash functions and hard core predicates is disproven. 



1 Introduction 

Due to space restraints we assume that the reader is familiar with basic notations 
such as one-way functions and hard core predicates. For a detailed and more 
formal discussion see the full version of this paper A survey of hard core 
predicates can be found in |^. 

2 Main Results and Motivation 

In ^ it was shown that every single bit is a hard core of the RSA function. 
The same proof techniques were applied to show that all (but the first least 
significant) bits of the discrete exponentiation function are hard. It was already 
noted in ^ that the multiplicative structure of RSA was crucial to apply these 
techniques. For instance, by the equation RSA(ccc) = RSA(c) • RSA(a:) one can 
compute RSA(ca:), given RSA(a;) and c. In this paper we present a useful prim- 
itive, the hidden number problem, and study under which circumstances it is 
solvable. As a corollary to our result we conclude the bitsecurity of many known 
cryptographic functions such as RSA, Rabin and FlGamal. 

Although the original proofs of bitsecurity of the RSA 
and the discrete exponentiation (EXP) function are sim- 
ilar there is no straightforward way to conclude the result 
from each other, i.e. to deduce the bitsecurity of EXP di- 
rectly from the bitsecurity of RSA. The figure on the left 
illustrates this connection. The hidden number problem 
was introduced by Boneh and Venkatesan PJ and used 
to show the security of the collection of ^/[ogn unbiased 
most significant bits of the Difhe-Hellman function. 
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Main Results. Under some reasonable combinatorial assumptions, every bit is 
a hard core of the following functions: 

1. The RSA encryption function. 

2. The ElGamal encryption function. 

3. A Modified Difhe-Hellman function. 

4. The Discrete Exponentiation function (all but the first least significant bits). 

5. The Rabin encryption function. 

6. Paillier’s Encryption function. 

Note that 1, 4 and 5 are known results whereas 2, 3 and 6 are new contributions 
of this paper. Due to space restrictions we only consider the RSA and ElGamal 
case here. The other proofs exploit the multiplicative properties of the functions 
in a straight forward way. See j5]. 

As a second corollary we give as an alternative to the famous Goldreich-Levin Bit 
0 a new and very simple construction of hard core predicates for any one-way 
function. Naslund [7| showed that the predicate b(x,r,s) := biti(rx-|- s mod p) 
is a hard core of any one-way function. We improve this by showing that the 
addition “-|-s” is not necessary: 

Theorem 1. Let f be an arbitrary length-regular one-way funetion and let g be 
defined by g{x,r) := (/(x),r), where n = \x\, p is an n-bit prime and r € Z*. 
Let bi{x,r) := biti(rx mod p). Then for every i = fin) G {0, ...,n— 1} the 
predicate bi is a hard core of the function g. 

3 The Hidden Number Problem 

Definition 1 (Hidden Number Problem HNP). Let L C IN be an infinite 
set of integers and P a polynomial. For all N G / and all hidden numbers 
X G let {On,x)n,x be a family of hidden number oracles for the i-th bit with 

Pr[ON,x{c) = hiti{cx mod N)] > 1/2 -|- 1/P{n), 

where the probability is taken over all c G uniformly distributed and over 

all coin tosses of the oracle, n is the bitlength of N. 

We say that HNP is efficiently solvable for bit i if there is a polynomial Q and 
an oracle pptm pW’n.x) allowed to make queries to such that that for every 
N G I and x G outputs x with probability at least 1/Q{n). 

Theorem 2 (Main theorem). 

1. If I is the set of all odd primes, then HNP is efficiently solvable for all bits. 

2. If I is the set of all odd integers N , then HNP is efficiently solvable for all 
bits or one gets a non-trivial factor of the integer N . 

3. If for a fixed integer k, I is the set of all integers of the form N = 2^ ■ q, 
q odd, then HNP is efficiently solvable for the bits k < i < n or one gets a 
non-trivial factor of the integer q. 

As mentioned before, the proof of this theorem is analogous to the proof of the 
security of every RSA bit 0. See the full version 0 for a proof of the special case 
of the least significant bit. Note that Theorem 0 is a corollary to Theorem 0 (1). 
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4 Sample Applications of the Hidden Number Problem 

Due to space restrictions, we only show the bitsecurity of RSA and ElGamal here. 

RSA Bits. Let /rsa = {{N, e) : N = p-q, \p\ = \q\,p, q prime, gcd(e, </)(A^)) = 1} 
be the set of all possible RSA instances, where cj>{x) denotes the Euler Totient 
Function. Given (N,e) G /rsa and x G the RSA function is defined as 

RSAjv,e(a^) := mod N. The RSA function is believed to be one-way, which is 
called the strong RSA assumption. 

Theorem 3. Under the strong RSA assumption, every bit is a hard core of the 
RSA function. 

As usual, the proof is by reducio ad absurdum. Assume that the predicate bit^ 
is not hard, i.e. that there exists an oracle Orsa that non-trivially predicts the 
i-th bit of X given RSA7v,e(2^) as input. Then this oracle can be used to construct 
an oracle pptm inverts the RSA function. 

The inversion is done by simulating the hidden number oracle On,x for the 
hidden number x, given an instance of the RSA function RSA 7 v,e(a^) = mod 
N. For the simulation we set 

On,x{c) := Orsa(c® • RSA7v,e(a;)) = C/RSA(RSA7v,e(c • a;)), (4.1) 

which is biti(ca:) in case the RSA oracle did not lie. Now we can apply Theorem 0 
(2) which shows that we can compute x or get a factor of N. In both cases we 
can compute x. This is a contradiction to the strong RSA assumption. 

ElGamal Bits. The ElGamal public key cryptosystem encrypts a message 
X G Zp given a public key by computing the pair {g^, xg°‘^). Here b is chosen 
uniformly at random from the set Z*. Decryption using the private key a is 
done by computing and then dividing to obtain the plaintext x. 

In order to break this cryptosystem one has to “invert” the ElGamal function 

Elp^3,a.6(a;) := 

in a sense that given (g“, xg°'^) one has to compute x. The ElGamal function 

is believed to be one-way, what follows from the Gomputational Difhe-Hellman 
(GDH) assumption. 

Theorem 4. Under the CDH assumption, every bit is a hard core of the ElGa- 
mal function. 

Assume again there is an ElGamal oracle Oei that non-trivially predicts the i-th 
bit of X given Elp_g_o_b(a;) := {g°' , 9 ^ , xg°'’’) . The central equation (corresponding 
to equation IHH) in the RSA case) to simulate the hidden number oracle is: 

On Ac) ■■= OEi{g‘^+\g^+\c-xg^‘^+^^^^+A = OEi{EAg,a+r,t+s{cx)), 

which is biti{cx) in case the ElGamal oracle did not lie. The integers s and 
r are due to technical reasons. They are chosen at random from the uniform 
distribution on in order to randomize the input. 
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5 Universal Hash Functions &: Hard Core Bits 

All in the literature known general constructions of hard core predicates for 
any one-way function, for example the Goldreich-Levin Bit |3], are based on 
some set of universal hash functions (UHF). The natural question, first asked 
by Naslund 0 in his paper “Universal Hash Functions & Hard Core Bits” and 
later in m, was whether there is a “nice connection” between universal hash 
functions and hard core functions. We sketch a simple counterexample to this 
conjecture. 

For a prime p consider the set of UHF Sp = {s(a;) = {ux + v modp) mod 2 : 
u G Zp, V G 2Zp}. It follows from [3| that Sp gives a hard core predicate for any 
one-way function. Applying the one-way permutation f{x) = mod p, where 
g denotes a primitive element of we construct a new set of UHF Hp = 
{h{x) = {ug^ -I- V mod p) mod 2}. But the derived predicate b{x, u, v) := {ug^ + 
V mod p) mod 2 is not a hard core for the (special) one-way function g(x, u, v) := 
{g^ mod p,u,v). Theorem [D gives us a set Tp = {t(a;) = (ux mod p) mod 2} 
giving hard core predicates. In the full paper we show that Tp is only “2/3- 
universal” which means it is almost universal. Together with the above result 
this shows that the concepts of UHF and hard core predicates seem to be less 
related than it was assumed previously. For a more detailed analysis of the 
relation between universal hash functions & hard core bits see jH|. 

Acknowledgment. This paper greatly benefited from the helpful insight of 
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Abstract. All solutions of Pythagorean equation (P-equation) xf + 
X 2 = X 3 in relatively free rings of varieties of n-nilpotent associative or 
associative-commutative rings (n=3,4) are described. In particular, it is 
shown that Pythagorean equation has no minimal and complete set of so- 
lutions in free rings of such varieties, so unification type of these varieties 
is nullary. This is also valid for the variety of associative-commutative 3 
(or 4)-nilpotent rings of characteristic two. 



Unification theory is concerned with solving equations in a relatively free alge- 
bras of varieties. Equational theories which are of unification type unitary or 
finitary play an important role in automated theorem proving, term rewriting, 
in logic programming with equality [1]. Investigating the unification hierarchy 
and determining unification type of equational theories is thus interesting for 
unification theory and its applications. 

We investigate unification theory in the varieties of associative rings without 
unit and in their subvarieties n-nilpotent rings (n = 3,4). For n = 2, i.e., for 
associative rings with zero multiplication, any equation in free ring of this variety 
is linear over Z. Therefore, unification type of this variety is finitary. We will 
show that unification in the varieties of n-nilpotent associative or associative- 
commutative rings (n = 3,4) is nullary. Below, ’’rings” will always mean a ring 
without unit, unless otherwise is stipulated. 

Unification theory of the varieties of associative and associative-commutative 
rings with unit have been considered by several authors. Franzen [3] proved that 
Hilbert’s Tenth Problem is of the unification type zero, i.e., variety of associative- 
commutative rings with 1 is of nullary type. Martin and Nipkov [4] proved that 
unification in varieties of the Boolean rings and primal algebras is finitary. See 
survey [1] for unification in other equational theories. 

The basic concepts of unification theory in the case of rings are as follows 
[1]. Let V be a variety of associative rings and V = IdV an ideal of identities of 
the variety V. Denote by Fv{X) and Fv{A) relatively free rings of the variety 
V with a countable set of generators X = {xn \ n < co} and A = {a„ | n < to} , 
respectively. We will consider equations in the first ring and look for their solu- 
tions in the second one. A solution to equation w = 0, w £ Fv{X), in Fv{A) (or 
in the variety V) is a homomorphism ^ : Fv{X) — >■ Fv{A) such that ^{w) = 0. 
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Let if be a system of equations in the variables Xi € X. Given two solutions 
r and cr of 17 we say that t is at least as general as cr and write r < cr if there 
exists an endomorphism a of Fy(A) such that ar = a. There is a naturally 
associated equivalence relation ~ defined by cr ~ r <1=^ a < t and t < a. 
A partial order on the equivalence classes of this relation is induced by <. A 
solution of the system S minimal in respect to this ordering, is called minimal 
(or the most general solution). We say that E is unitary if this partial order 
has the smallest element; finitary (infinitary) if the set of the minimal solutions 
is finite (infinite) and every its element lies above some minimal element. It is 
nullary if there is a solution which does not lie above any minimal one. 

Variety V is called finitary (infinitary or nullary) if there exists a system of 
equations having the corresponding type in a free ring of this variety. 

Proposition 1. Unification type of P- equation in variety of associative-commu- 
tative rings is not finitary. 

Proof Let Fcom{A) be a free ring of the variety of associative-commutative 
rings. Let us prove first that all non-trivial linear solutions to P-equation in 
Fcom{A) are triples {pf{a), qf{a), rffa)), where (p, q, r) are P-triples of integers 
and /(a) are linear Z-forms in the variables Ui G A. Let (pi(a), <^2(0), ^3(0)) 
be a linear solution of P-equation, (pi(a) G Fcom{A). Assume that pi(a) and 
P2(a) are not proportional and pf(a) -I- P2(®) = 'pK®)- Then the rank of the 
form Pi (a) -l- P2(®) Q is equal to 2, while the rank of p§(a) is equal to 1, 
which is a contradiction. In a similar way one can show that pi(a) and P3(a) 
are also proportional. 

Obviously, any such linear solution of P-equation can be obtained from solu- 
tions (poi, qai, rai). Since Fcom{A) is the ring without 1 , P-triples (poi, qa\, 
ra\) are minimal elements in the set of solutions of P-equation in this ring. 
Since these solutions are not comparable for different P-triples (p, q, r), the 
Proposition is proved. 

Remark 1. This Proposition is valid if the equation is an arbitrary quadratic 
Z-form. 

Corollary 1. Unification type of the variety of associative rings is not finitary. 

Let us now consider the unification type of nilpotent rings. Denoted by AMn 
(Afn) the varieties of n-nilpotent associative-commutative (associative) rings. Let 
Vi be the ideal of identities of the variety A/V3 and Fv^{A) a free ring of the 
variety AN z over a countable set of generators A. It is easy to prove 

Lemma 1. The general solution of P-equation in the variety ANz can be writ- 
ten as {pai-\- Rifa), qai-\-R 2 {a), rai -\- Rz{a)) , where (p, q, r) are P-triples over 
Z and i?i(a), i?2(o), Rsia) are arbitrary Z- quadratic forms in variables from A. 

To investigate the unification type of the variety ANz we will use Baader’s 
sufficient condition [ 2 ] for the unification type zero. For variety of associative 
rings, the condition can be formulated as follows. 
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Baader’s condition: Let F he a finite system of equations in Fv{X). By 
Uv{F) we denote the set of all solutions of F in the ring Fv{A). This set satisfies 
Baader’s condition if in Uv{F) there is a decreasing chain 0 i > O2 > ■■■ > On > , 

such that: (i) it has no lower bound in Uv{F); (ii) for all n > 1 and all 6 G Uv(F) 
with On > 0 there exists 0 G Uv{F) such that 0 >0 and On+i > 0 - 
The following proposition is proved in [2]: 

Proposition 2. The system of equations F is of unification type zero in a va- 
riety V ifUv{F) satisfies Baader’s condition. 

Theorem 1. P-equation is nullary in the variety AMz- 

Proof. Let us construct a satisfying Baader’s condition chain of solutions to P- 
equation in the variety A/V 3 . We choose {p, q, r) = (3, 4, 5) and consider a 
decreasing chain (Si) of solutions of P-equation in the ring Fv^{A), 0 \ > O2 P 

n+1 n+1 n+1 

• •• > 0 n> ■■■, where = (3ai-|- ^ of, 4ai-|- ^ af , 5ai-|- ^ of). Let us show that 

i—2 i—2 i—2 

this chain has no lower bound in C/i = Uvi(F), where F is P-equation. Indeed, 
suppose that there exists a lower bound 7 of the chain (Si) in Ui. According to 
Lemma 1 we can represent the element 7 in the following form: 7 = (3ai-|-i?i(a), 
4ai -I- i? 2 (a), 5oi -I- Rsid) . Now, let Fi = Fy^{A) be the Zs-algebra of reduced 
polynomials modulo 3 of the ring Fy^{A). Denote by /(a) (or simply by / ) 
the image modulo 3 of element /(a) G Fy^{A) in algebra Fi. Consider the 
decreasing chain (S 2 ), Ox > O 2 > ... > On > of solutions of P-equation in 
Fi, where On G Fy^{A) are elements of the chain (Si). Clearly, the element 
7 = (i?i(a), ai -I- i? 2 (a), 2ai -I- Rsid)) is a lower bound in Ui = C/i(mod3). of 
the chain (S 2 ). Let us now consider a sequence (S 3 ) of the first coordinates of 

_ _ _ _ m + 1 

the chain (S 2 ), 6 *n, O21,..., Omi, where Omi = Obviously, the sequence 

k=2 

(S 3 ) is nondegenerate. We can regard Omi as ^ 3 -quadratic forms in variables 
Ofc. Clearly, rank{0mi) = m. Let r = rank{Ri{a)) 1, where Rifa) is the first 
coordinate of the lower bound 7. Consider the element Oj. of the chain (S 2 ). Since 
Or > 7 ) there exists a homomorphism if : Fi — >■ Fi such that f’{Ri) = Ori- 
However, since Fi is a 3-nilpotent ring homomorphism determines linear 
transformation if' of the variables Ofc G A. Comparing the ranks of the elements 
Ori and if'{Ri) we have get contradiction. Condition (i) is proved. 

Let 0 be a solution of P-equation satisfying 0 < On for some n G N. By apply- 
ing the Lemma 1 we obtain 0 = (3/(a) -I- Li(o), 4/(a) -|- ^ 2 ( 0 ), 5/(a) -I- L^fa)), 
where /(a) is a linear Z-form and Li, L 2 , L 3 are quadratic Z-forms in variables 
Oi G A. Let Om be an element from A that does not occur in representations 
of elements 0 and via of the generators a* G A. We set 0 = 0 + (a^, a^, 
a^). Clearly, the element 0 is a solution of P-equation and we can show that 
the condition (ii) is satisfied for this element. Indeed, since 0 < On, there exists 
a homomorphism Ci : Fy^{A) — >■ Fy^{A) such that (i{0) = On- Define homo- 
morphisms (2 and C 3 G Hom{Fy.^{A), Fyj^(A)) in the following way: = o-i, 

if z yf TO, and C 2 (om) = 0, C,z{ai) = Ci(a»), if|yf to, and Cs(am) = o„+i, where 
Oi G A. It is easy to see that C 2 (^*) = 0 and Cs(0) = On-ei - The Theorem is proved. 
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Corollary 2 . Unification in the variety AA/’a is nullary. 

In a similar way one can prove that the varieties A 3 , AV 4 and A 4 have 
nullary unification type. 

Denote by AJ\fn,2 {J^n,2) the subvariety of the variety AMn {J^n) rings of 
characteristic 2 . Variety AN'n,2 {J^n,2) is a minimal subvariety of variety Ahfn 
(Afn). We will show now that the variety AVa 2 also has nullary unification 
type. Note that the rank is not invariant of quadratic form in free associative- 
commutative rings of characteristic 2. For such rings the proofs nullarity have 
to be modified 

Denote by V 4 the ideal of identities of the variety A 3 , 2 , and Fv^{A) a free 
ring of the variety AN^^2 over a countable set of generators A. The following 
statement be proved. 

Lemma 2 . Let B = ai02 -I- 0304 -I- ... -I- a2k-id2k, 0 < k < n, and D = 0402 -I- 
a3a4 + --- + a2fc_ia2fe + --- + a2ri_ia2„ he elements of the ring Fv^{A). There exists 
no homomorphism ip € Hom(Fy^(A), Fy^(A)) such that p{B) = D. 

It is easy to describe the solutions of P-equation in the variety AV 3 , 2 . 

Lemma 3 . The general solution of P-equation in the variety AV’3,2 consists of 
two types of solutions: 

(®1 + X]®2i-ia2i) ^2 + X) <^2r-lO'2r, (oi + ^2 + X ‘^2k-ld2k)) Or 

i—1 r—1 k—1 

h h h 

(Xa2*-ia2i, Xa 2 r-i 02 r, Xa2fe-i02fc), where Oi £ A and G N (i = 1 , 2 , 3 ). 

i—1 k—1 k—1 

Using the Lemma 2 and Lemma 3 one can prove, as in the Theorem 1, the 
following 

Theorem 2. P-equation is nullary in the variety AV’3,2. 

Unification type of varieties of n-nilpotent associative-commutative rings for 
n > 5 can be investigated in a similar manner. However in this case one should 
use /c-forms in variables Oj G A, for fc > 3. This case will be considered elsewhere. 

Acknowledgment. The author would like to thank G. Belitski, G. Mashevitzky 
and B.I. Plotkin for many helpful discussions. 
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Abstract. Linear cellular automata have been studied in details using 
algebraic techniques ^ . The generalization to families of polynomial cel- 
lular automata seems natural. The following step of complexity consists 
of bilinear cellular automata which study has begun with the work of 
Bartlett and Garzon [2|. Thanks to bulking techniques |5|, two-states 
bilinear intrinsically universal cellular automata are constructed. This 
result answers a question from Bartlett and Garzon |2] of 1995. 



A cellular automaton consists of a regular network, for example a line of cells, 
carrying finite values that are updated synchronously on discrete time steps 
by applying uniformly a local rule. Despite their apparent simplicity, cellular 
automata exhibit varied, sometimes complex, behaviors. 

The properties of linear algebraic objects are easier to describe than the 
properties of general objects. In the case of cellular automata, the study of linear 
cellular automata has begun with the work of Martin et al. P] . They showed that 
linear cellular automata are really simple and they completely described their 
behavior. In 1995, Bartlett and Garzon |3 studied bilinear cellular automata. 
They proved that a particular sub-family of bilinear cellular automata, cellular 
automata over with p prime, is as complex as the whole family of cellular 
automata. The question remained open whether bilinear cellular automata over 
TZjm were as complex as the whole family of cellular automata for small values 
of m. Moreover, the result from Bartlett and Garzon |2| was given thanks to a 
notion of simulation and 7r-universality that were not formally defined. 

In the spirit of Mazoyer and Rapaport 0 , we have introduced a tool, called 
geometrical bulking, to classify and prove properties on cellular automata. This 
tool is based on a notion of simulation and a notion of space-time diagrams 
rescaling. Within this scope, we formalized the notion of intrinsic universality 
implicitly introduced by Albert and Gulik II fP, which corresponds to the notion 
of TT-universality in the paper of Bartlett and Garzon P). 

In the present paper, we close an open question from Bartlett and Garzon |2j 
by constructing a two-states bilinear intrinsically universal cellular automaton. 

* a longer version of this paper is available from the author, see |Sj 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 330- 1^1 2001. 

© Springer- Verlag Berlin Heidelberg 2001 



Two-States Bilinear Intrinsically Universal Cellular Automata 397 



1 Cellular Automata and Geometrical Bulking 

In the following, we only consider one-dimensional cellular automata, that is 
straight lines of cells. We briefly recall some necessary definitions and theorems 
about cellular automata and geometrical bulking. The interested reader is invited 
to consult the longer version of this paper ^ . 

Definition 1. A cellular automaton A is a triple (S, {ni, . . . , n^} , S) such that 
S is a finite set of states, N is a finite ordered set of d integers called the neigh- 
borhood of A and 6 is the local transition function of A which maps to S. 

A configuration C of a cellular automaton A maps Z to the states set of 
A. The state of the i-th cell of C is denoted as C^. The local transition function 
5 of A is naturally extended to a global transition function which maps a 
configuration C of A to a configuration C of A satisfying, for each cell i, the 
equation — 6 . . . , 

A sub-automaton of a cellular automaton corresponds to a stable restriction 
on the states set. A cellular automaton is a sub-automaton of another cellular 
automaton if (up to a renaming of states) the space-time diagrams of the first 
one are space-time diagrams of the second one. To compare cellular automata, 
we introduce a notion of space-time diagrams rescaling. To formalize this idea, 
we introduce the following notations: 

cr^. Let S' be a finite state set and k be an integer. The shift cr^ is the bijective 
map from onto which maps a configuration C to the configuration 
C such that, for each cell i, the equation = G is satisfied, 
o™. Let S be a finite state set and m be a strictly positive integer. The packing 

77 7j/j 

map o’" is the bijective map from onto (S’") which maps a con- 
figuration C to the configuration C such that, for each cell i, the equation 
— {Cjni, • . • i) Is Satisfied. 

Definition 2. Let A be a cellular automaton with states set S. A {m,n,k)~ 
rescaling of A is a cellular automaton with states set S’" and global 

transition function o o’" o G(^ o o“’". 

Definition 3. Let A and B be two cellular automata. Then B simulates A if 
there exists a rescaling of A which is a sub-automaton of a rescaling ofB. 

This relation of simulation has good properties. In particular, in p|, we proved 
that the relation of simulation is a quasi-order with a maximal induced equiv- 
alence class exactly corresponding to the set of intrinsically universal cellular 
automata in the sense of Albert and Culik II p. As any cellular automaton 
can be simulated by a one-way cellular automaton, that is a cellular automaton 
with neighborhood {—1,0}, there exist intrinsically universal one-way cellular 
automata. Therefore, to prove that a particular family of cellular automata con- 
tains an intrinsically universal cellular automaton, it is sufficient to prove that 
any one-way cellular automaton can be simulated by a cellular automaton from 
the family. The details and the formal definitions of intrinsically universal cellu- 
lar automata are presented in the longer version |n|. 
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2 Two-States Bilinear Cellular Automata 

Bilinear cellular automata are polynomial cellular automata which polynomial 
is a bilinear functional. Bartlett and Garzon 0 proved the universality of this 
family of cellular automata in the special case where the states set is of the kind 
with p prime. We prove that the family of bilinear cellular automata with 
states set Z 2 is universal, answering their question concerning bilinear cellular 
automata with states set for small values of m. 

Definition 4. A bilinear cellular automaton is a polynomial cellular automaton 
of degree 2, that is, a cellular automaton which states set is a finite commutative 
ring and which local transition function can be represented as a polynomial only 
consisting of quadratic monomials: <5 (si, . . . , Sd) = bi,jSiSj. 

Theorem 1. Each one-way cellular automaton can be simulated by a two-states 
bilinear cellular automaton. 

Proof. Let A be a one-way cellular automaton {S, {—1, 0} , (5). Let n be the car- 
dinal of S. Up to a renaming of states, we can assume that S = {0, 1, . . . , n — 1}. 
We construct a two-states bilinear cellular automaton {Z 2 , {—r, . . . ,r} , P) 
which simulates A. The basic idea of the construction is to represent a cell 
of a configuration of A by a block of cells all but one in the state 0. The position 
of the cell with value 1 determines the state for the cell of the configuration of 
A. To encode the transition 5(i,j) = k, we build a monomial SpSq where p is the 
distance from the position of k to the position of i and q the distance from the 
position of k to the position of j as represented on Fig. ^ To avoid multiplying 
the monomial by (1 — s;) for every I between p and q, we must be sure that all 
these cells can only be 0. Eventually the mapping from (f, j, k) to (p, q) must be 
injective to avoid any “misinterpretation” of a monomial. 
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Fig. 1. The idea of a cell encoding 

A complete and motivated construction can be found in the longer version of 
this paper [Oj. Here, we only provide the technical part of the proof. We choose to 
discriminate between the encoding of cells of a configuration of A thanks to the 
parity of its position into the configuration. An even cell with value i is encoded 
as a block of cells of size 18n^ with value 1 at cell (6n -I- 2)i. An odd cell with 
value j is encoded as a block of cells of size 18n^ -|- 1 with value 1 at cell 6nj. 

First, we show that the distance between two cells with value 1 permits 
to know whether it correspond to two next encoded cells. The maximal dis- 
tance between two cells with value 1 corresponding to two next encoded cells is 
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18n^ -I- 1 -I- (6n + 2)n = 24n^ -|- 2n -I- 1. The minimal distance between two cells 
with value 1 corresponding to two encoded cells at distance three or more is 
18n^ — (6n -|- 2)n + 18 it? -I- 1 = 30n^ — 2n -|- 1. As n ^ 1, it is clear that two cells 
with value 1 correspond to two next encoded cells if and only if their distance is 
less or equal to 24n^ -|- 2n -I- 1. 

Second, we show that, when building a monomial SpSq encoding the transition 
S{i,j) = k, the mapping from {i,j,k) to (p, g) is injective. As it is straightfor- 
ward to compute, if the position of k is odd, p = 18n^ — (6n -I- 2)i -|- 6nk and 
q — 6nj — 6nk] if the position of k is even, p = 18n^ -I- 1 — 6m -|- (fin -|- 2)k and 
q = (6n -|- 2)j — (fin -|- 2)k. Being given p and q, computing p modulo 2 gives the 
parity of the position of k. Then, depending of this parity, p modulo 6n permits 
to obtain i or k. From p, we then deduce both i and k. Finally, we compute j 
from q. The mapping is injective. 

From the above computations, we obtain the following polynomial. 



n—1 n—1 



P(^S—r‘f • • • 7 EE ^18n'^ — {6n+2)i+6n5{i,j)^6nj—6nS{i,j) 

i 0 j 0 d~Sl8n^-|-l— 6ni+(6n-|-2)(5(i,j)®(6n+2)j — (6n-|-2)<5(i,j) 

The parameter r can be chosen as 36n^ -|- 1. ■ 



3 Conclusion and Open Problems 

In this paper, we have proven, thanks to geometrical bulking, the existence of 
two-states bilinear intrinsically universal cellular automata, drastically decreas- 
ing the previous known number of states (using 2 states instead of 211^^^ states 
in the paper of Bartlett and Garzon j2|). Our result naturally extends to higher 
dimensions. The difference between linear and non-linear cellular automata is 
worth studying. To continue the study of bilinear cellular automata, one has to 
find a bound on the neighborhood size for intrinsical universality (our best today 
estimation is a radius of 1297 cells). 
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Graph automata were first introduced by P. Rosenstiehl |^, under the name of 
intelligent graphs, surely because a network of finite automata is able to know 
some properties about its own structure. Hence P. Rosenstiehl created some 
algorithms that find Eulerian paths or Hamiltonian cycles in those graphs, with 
the condition that every vertex has a fixed degree Those algorithms are 
called ’’myopic” since each automaton has only the knowledge of the state of 
its immediate neighborhood. A. Wu and A. Rosenfeld (0 0 ) developed ideas 
of P. Rosenstiehl, using a simpler and more general formalism : the d-graphs. 
Hence, a graph automata is formed from synchronous finite automata exchanging 
information according to an adjacency graph. A. Wu and A. Rosenfeld gave a 
linear algorithm allowing a graph automata to know if its graph is a rectangle 
or not, then, E. Remila |3 extended this result to other geometrical structures. 

We give here a new and very general method that allows to recognize a large 
class of finite subgraphs of (and a lot of subclasses) by putting orientation 
on edges and computing coordinates for each vertex. Depending on the class of 
graphs we want to recognize, we could use different processes on the border of 
the detected geometrical structure. The difficulty of this task is due to the finite 
nature of the automaton. As it has a finite memory, an arbitrarily large number 
(coordinates) can not be stored in such a memory. Thus we have to use several 
techniques : we use time differences to compute coordinates, and we will define 
signal flows to do comparisons between coordinates. 

Proposed algorithms have a very satisfying linear or quasi-linear time com- 
plexity. Details can be found in [J- 

Definitions. Let c? be a fixed integer such that d > 2. A d-graph is a 3- 
tuple (G, vq, g), where G = {V, E) is a symmetric connected graph with only two 
kinds of vertices : vertices of degree 1 (which are called ^-vertices) and vertices 
of degree d; Uq is a d-vertex of G which is called the leader, or the general; g 
is a mapping from E to |l,2,...,d} such that, for each d-vertex v of V, the 
partial mapping g{v, .) is injective. The subgraph G' of G induced by the set of 
d-vertices is called the underlying graph of G. 

From any graph G' of degree at most d, we can construct a d-graph (G, VQ,g) 
whose underlying graph is G'. For each vertex, we call an up-edge, an edge that 
links this vertex to another vertex that is closer from the leader than itself. 

Graph Automata. A graph automaton is a set M = (Gd, A) with Gd a d- 
graph, and A a finite d-automaton (Q, 5 \ Qx ^ H> Q). A configuration 
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C of M is a mapping from the set of vertices of Gd to the set of states Q, such 
that (7(1)) = # iff u is a ^-vertex. 

We define, for each vertex v, the neighborhood vector H{v) as in We 
compute a new configuration from a previous one by applying the transition 
function 6 simultaneously to each vertex of Gd, by reading its state and the 
states of its neighbor : Gnew{v) = S{G{v),H{v),G{i'i), ...C{iyd))- 

Hence, we have a synchronous model, with local finite memory. There is a 
quiescent state go in C? : a vertex v and all its (non-^) neighbors are go, v will be 
go in the next configuration. The initial configuration is a configuration where 
all vertices are go, except the leader, which is qinit- 

To simplify our explanations, we will consider that each automaton sends 
and receives signals to/from its neighbors at each step of computation. To allow 
simultaneous exchanges, we consider a message as a set of signals. 

Recognition of 4-Graphs. Now, we will only use automata with two partic- 
ular states : the accepting state and the rejecting state. 

The class of accepted graphs for one d-automaton A is the class of d-graphs 
Gd such there is a configuration, with the leader is in the accepting state, that 
can be reached in {Gd,A). 

A recognizer is an automaton that rejects all d-graphs that it does not accept. 

So the aim of the recognition of d-graphs is to build a recognizer for a given 
class of d-graphs. We will study the case d = 4 (since many automata are already 
known on this particular topic). Before this paper, the main recognized classes 
were the finite classes and some special classes with geometrical properties : grids 
|Z], tori, cylinders, spheres or Moebius’s ribbons |3|. 

The planar grid A is the infinite symmetric graph isomorphic to We 
want to recognize classes of 4-graphs whose underlying graphs are isomorphic to 
subgraphs of A. A finite subgraph G = {V,E) of A is said square compatible if 
if is a union of elementary squares of A. If there is no vertex of V that is owned 
by exactly two non-adjacent squares, we say that G is locally square connected. 

The 2-neighborhood of a vertex v of a 4-graph G = (V, E) of A is the sub- 
graph composed by the square containing the vertex v. 

Notice that two isomorphic locally square connected subgraphs of A also are 
isometric (in an Euclidean point of view). The main tool which will be used is 
the notion of local orientation. 

An orientation O of G is a mapping from E to {North, East, South, West} 
with good obvious local properties. Each subgraph of A obviously admits an ori- 
entation, but some other graphs also do. 

Moreover, an orientation of a locally square connected subgraph of A is com- 
pletely determined by the orientations of the edges outgoing from a fixed vertex. 
This is not true for connected square compatible subgraphs of A. This fact is 
important for the determination of a possible isomorphism from a given graph 
to a subgraph of A. It is the reason why we first limit ourselves to the class of 
locally square connected graphs. 

The Algorithm. Now, we build a recognizer. The algorithm of recognition is 
divided into three steps : Orientation, Coordinates and Verification. 
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Orientation. To be able to put an orientation on each edge, 4-automata put 
indexes {x,y) € on the state of each 4-vertex of the 4-graph given. Those 
indexes will be used as coordinates in modulo 6 and canonically induce an 
orientation of edges. 

The process of orientation of edges consists in exploring the potential 2- 
neighborhood of each 4-vertex of the underlying graph and giving indexes to its 
neighbors (new indexes have to be compatible, in the sense of coordinates in 
modulo 6, with the previously given indexes). 

The process is done by successive five time steps stages. For initialization, 
the leader takes the (0,0) index. During the first stage, indexes are given to 
neighbors of the leader vertex. At the stage, indexes are given to vertices at 
distance n from the leader. 

Each stage is computed using signals exploring the 2-neighborhoods of ver- 
tices. Those signals are indexed by the index of the center of the 2-neighborhood 
explored, in order to avoid interferences between signals issued from different 
vertices. They also have an edge index, which indicates the number of the edge 
from which they leave the center. 
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Fig. 1. Exploration of a 2-neighborhood 

Coordinates of Vertices. We would like to place coordinates from on 
each vertex. As finite automata have a finite memory, we can not put coordinates 
from into it, as an arbitrarily number can be as great as we want. So we will 
encode those coordinates into the time-space diagram of the automata, using a 
classical process (0) as follows : 

First, the leader sends two different signals to all its neighbors : A signal 
Vertical and a signal Horizontal. 

Those signals will spread into the graph, by being sent by each vertex through 
its down-edges. Each time a vertex receives a signal from its up-edges, it waits one 
step time and sends the same signal through its down-edges (we say that signals 
have a speed of 1/2), except for the Vertical signal that has a speed of 1 through 
the North edges and 1/3 through the South edges, and for the Horizontal signal 
that have a speed of 1 through the East edges and 1/3 through the West edges. 
Finalization. Let G = (V,E) denote the underlying graph of the 4-graph 
given. If the processes above succeed, one can define a mapping / from V to 
such that, for each vertex v, f{v) = {uh — 2c?;, n„ — 2c?;) where nt, riy represents 
the time where the vertex receives respectively the Horizontal signal or the 
Vertical signal and d; is the distance to the leader. 

We now have to see if / is injective or not. As we have a 4-graph, with every 
edge being oriented and every vertex having coordinates. This property allows 
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Fig. 2. (Left) Space time of the coordinates process, (Right) A graph with 8 vertices 
covering 

us to say that the graph is a subset of the infinite grid 2Z^ if and only if there are 
no two vertices with the same coordinates. This is the problem of the covering. 

We give some linear time solution for specific classes, like convex figures, and 
a nlog(n) optimal solution for the general case using binary signal streams. 

Extensions. First, it can be extended to other dimensions, like the infinite 
grid 71^ , or The extension to TZ,"^ just needs to consider more possible local 
configurations,. We can also see that we do not need any property of planarity 
in our algorithm. It can be extended to any regular Euclidean network. In fact, 
we just need a notion of direction (cardinal points), and the “commutativity of 
edges” : To go from a vertex to another one, we can exchange parts of a path 
to find another valid path. This property induces a coordinates system as the 
canonical system we use on 7Z^ . 

The locally square connected property is not necessary for linear time com- 
plexity : we can use a global square connected property, with more state in the 
automaton. We must finally note that there is a trivial algorithm for non-square 
connected property in exponential time, testing all possible orientations. 
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Abstract. This paper considers the problem of distributed dynamic 
task allocation by a set of cooperative agents. There are different types 
of tasks that are dynamically arriving to a system. Each of the agents 
can satisfy the tasks with some quality (which may be zero). Every task 
is augmented by the needed qualitative level of task’s execution. Thus, 
relation between agents and task types is fuzzy. The main goal of the 
agents is to maximize the overall performance of the system and to fulfill 
the tasks as soon as possible. This problem belongs to the Distributed 
Problem Solving class of Distributed Artificial Intelligence research. The 
results differ from that for task allocation in multi-agent environments 
where each agent tries to maximize its own performance. 

The principal result of the paper is a distributed polynomial algorithm 
for determining probabilistic optimal policy for task allocation in fuzzy 
environment. 



1 Introduction 

Efficient task allocation is very important when a group of agents try to satisfy 
a set of tasks. The main problem in the assignment of the tasks is that the 
agents have different capabilities. We propose to optimize the performance of 
a set of agents that need to satisfy a set of tasks by modeling these agents 
with a stochastic closed queueing network j^.The problem is divided into two 
subproblems: to determine a distributed policy of optimal task allocation and to 
find the optimal effort levels of the agents subject to certain constraints. A policy 
for task allocation is a set of probabilities Pij that a task of type tj is allocated to 
agent a^. For the first subproblem an algorithm for determining optimal policy 
is proposed. For the second subproblem an analytical solution on the base of 
the symmetries of initial objects is given in Pj. When all the variables of the 
performance function are free, the obtained maximum is global. This provides a 
way to estimate the quality of different methods. 

2 Environment 

We model the system by a bipartite graph with the agents as left nodes and task 
types as right nodes. The edges are weighted with the value of agent’s ability 
(or quality) to fulfill a task. 
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We use here the following notations: T - set of task types, |T| = r, t - a task 
type, t G T, A - set of agents, |A| = k, a - a,n agent, a G A, : A x T — >• [0, 1]; 
rz(a, t) is interpreted as a level of ability of an agent a to execute a task t. It is 
more convenient to write where i is the number of an agent at, and j is 

the number of a task type tj . The function v distinguishes a fuzzy set p G AxT. 
The value iy{a, t) is a measure of belonging of the pair (a, t) to the subset p. 

Consider a fuzzy subset At G A for every task type t. Measure of belonging 
of an element a G A to the subset At is the function Vt defined by the rule: 
t't(a) = v{a,t)- Correspondingly, for fuzzy subset Ta G T we have measure 

Va{t) = v{a,t)- 

In order to make 6 discrete, let us divide the interval [0, 1] into 10 parts. Every 
5 G [0, 1] belongs to one and only one of these disjoint intervals. We denote by 
5* the right bound of this interval. Assume that f{5) is the distribution function 
of the variable 5. Probability that <5 G [d* — 0.1, 5*] is //._q ^ f(S) d6. 

Consider the Cartesian product T x [0, 1]. Take a finite subset of pairs s = 
in this infinite set. Such s are called generalized task types. We also write 
t{s) = t, S*{s) = (5*. Besides the fuzzy sets At and Ta let us treat also ordinary 
sets As and Sa- Denote: A° - set of all a G A such that t't(a) > 0, \A^ \ = kt = kj, 
Ta - set of alH G T such that Vait) > 0, |T°| = Ta = n. 

Define ordinary sets: Ag for every generalized type s = (t, S*) and Sa for 
every agent a. Let As = {a|iz(a, t(s)) > <5*(s) V iy(a,s) = maxa(i^(a,t(s)))}. If 
for every a we have i/(a,t(s)) < d*(s) then Ag consists of only those a for which 
iy(a, s) is maximal. If iy(a, t(s)) > d*(s) holds true for some a, then disjunction is 
true independently from the second condition. According to this definition, Ag is 
never empty and if the task of the type s is on the input, it will be served anyway. 
Let us pass to the set Sa = {s|iz(a, t(s)) > S*(s) V n(a,s) = maXs(iy(a,t(s)))}. 
The cardinality of this set is l^a] = CTo. is a set of all generalized task types 
s, which an agent a can fulfill with needed level, and if there are no such agents, 
then this set is the set of s with maximal iy(a,t(s)) for the fixed a. 



3 Methods 

We need an algorithm which calculates a probability that the arrived task will 
be allocated to an agent a. Denote this probability by Pa- The collection of such 
probabilities for all of the agents allows to solve the problem of optimization of 
performance of the system of cooperative agents and to find optimal intensities of 
the agents. Each arriving task has its type t and also a numerical characteristic d, 
i.e., a task is a pair {t, 5 ) . A task (t, S) is allocated to an agent from the set Ag, s = 
(t,6*). Thus, in order to compute the probability Pa, we need first the probability 
Pt,s that the task has characteristic (t, <5) and then conditional probability Ps^a 
that if the task has characteristic (t,6), then it is allocated to the agent a. The 
following formula determines probabilities Pa- Pt,s = Pt //._o i = Ps and 

Pa = J2sPsPs,a- 

We propose several algorithms to determine probabilities Ps,a- 
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Method 1. We introduce equivalence fig on the set Ag. It is defined by the rule: 
ai/3s02 ^ v{ai,t) = v{a2A)) where t = t{s). This equivalence partitions the set 
Ag into the set of classes of equivalent agents. All the agents of a class have the 
same abilities from the point of view of the client which issued the task of the 
generalized type s. But these agents are different over other parameters. 

The classes are sorted by the value of the function v{a, t) constituting several 
layers: the lowest layer is the class of equivalent agents from A„ with the minimal 
value of v{a,t)- Denote a layer containing an agent a by /3s (a), its number by 
nj3^{a) and its cardinality by |/3s(a)|. If a ^ Ag, then pg a = 0. Otherwise this 
agent a belongs to some layer (ig{a). All the agents in this layer are equivalent 
and thus probability to allocate the task to an agent is in inverse proportion 
to the cardinality of the layer and to the number of the layer, because we wish 
to save higher layers to the tasks with more strong conditions. The appropriate 
formula is pg^a = Cs_a(^/3s(®)l/^s(a)|)~^- In order to find weight coefficients 
we introduced the set Sa of generalized task types which can be fulfilled by an 
agent a, with the cardinality of this set a a- It is evident that the more is a a, the 
less is probability to send the task to this agent, since this agent can be involved 
also in other tasks. Hence, weight coefficients can be represented as Cg,a = 
and then pg^a = o:s(cran./3„(a)|/3s(a)|)“^. The set Ag is defined in such a way that 
it is never empty. This definition gives us opportunity to calculate normalizing 
coefficients ag, since every task will be allocated to one of the agents and thus 
all probabilities sum to one: 1 = Ps,a = otg denote 

^ “ X/aGAs CTaM/Ja (a)|/3s(o)| ■ Then ag — ^ 1 . 



p 



1 

s,a 



cTanig^(a)\t3s{a)\’ 0, € Ag, 
0, a ^ Ag. 



( 1 ) 



Method 2. Let us introduce an equivalence 7 on the set A of all agents: 01702 O 
Cai = cTaj where a a = |S'a|. Thus, we get partition of the set A, which induces 
partition of every Ag, and also partition of every layer of Ag. 

Denote by 7/3g partition of the layer ( 3 g by equivalence 7. For every a € Ag 
we have 7/3g(a), which is 7/3g layer, containing o. Denote probability 
that the task of type t will be allocated to the layer 'y( 3 g{a). This probability 
is: = ag{(TanfjX*^))~^ . The probability to allocate a task to the specific 

agent o within the layer is pg^a = as(o-an/3,(o)|/3s(a)|)“h 

Take sum over o € Ag in order to compute ag. The sum is double: first (inner) 
sum by 7 in the layer y/3g (/3g is fixed); external sum is by layers /3g. The second 
sum is due to double partitioning. 

Denote Then Og = A. Finally, 



P 



2 

s,a 



o-an,3, (a)|/3s(o)| ’ 

0, a ^ Ag. 



(2) 



Method 3. For every agent a we have denoted by Sa a set of generalized task 
types which this agent can fulfill. Let us consider an event that generalized task 
type s belongs to Sa- Probability of this event is the sum of all probabilities 
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Ps that the task is of generalized type s over all s G Sa' = ^seSaP^' 
idea of the third method is to use this probability instead of cardinality of 
the set Sa, which is quite natural: 



_ \ ^ 1 _ ^ 



' p^-ni3,{a)\(3s{a)y 

„3 ^ f ^ As, 



T)'^ = i ^ y - -/5s v-yi>^sv-yi f3') 

\0, a^As. 

Method 4. In this method we do not substitute aa in the formula for the 
probability Ps^a by but use both multipliers: 



1 



Ps,a — 



aaP^'^np^{a)\(3s{a)\’ 



= E 

Q-G-As 



1 



aaP^'^np^{a)\Ps{a)\' 



1 

ac = — r. 



„4 _ / (a)\/3„{a)\cra ’ ^ ^ ^s, 

lo, a^A,. 

Method 5. This method combines Method 2 and Method 3. 



( 4 ) 
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aaP''^‘^np^{a)\-fPs{a)\’ 



EE 

7 /3s 



aaP^“n0^{a)\-fPs{a)y 






,5 = ^ P*'“"/3s(a)l7/3s(a)ka i ® ^ 

0, a ^ Ag 



Ps,a 



1 

( 5 ) 



4 Algorithm 

We have proposed five empirical methods oips^a computation. None of the meth- 
ods is preferable since in different conditions any of them could be better than 
others. Thus, we formulate the following algorithm to find the policy of task 
allocation: (1) Compute Ps,a by all methods — >■ (2) Compute Pa using ps^a (3) 
Compute performance of the system — >■ (4) Choose the method with maximal 
performance to be our policy — >■ (5) Allocate arriving tasks according to the 
optimal probabilities. 
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Abstract. Let H 3 ~ ( Z3 ; + ) be the three-element group {a,a^,e} 
with a® = e; No =N U{0} ; !/<•> - the class of all first-order formulas of 
the signature < ■ > . For any recursively enumerable (r.e.) set A C N we 
effectively define an No x No-matrix Pk over H 3 and consider the Rees 
matrix semigroup Ck = M{H 3 , No, No,P/c). The following theorem 
presents the main result of the paper. 

Theorem. There exists a reeursive mapping N — > L<.>, giving for every 
m gN a corresponding elosed formula ipm. G L<.>, in such a way that, 
for every r.e. K C N; 
m £ K iff tfirn is valid on Ck • 

Such an interpretation of the membership problem for K in the elemen- 
tary theory T{Ck) enables us to estimate the complexity of T{Ck)', in 
particular, if A is a non-recursive set then T(Ck) is undecidable. 



Throughout this paper we use the following notation: ^ - ’’equals by defini- 
tion” ; - the class of all first-order formulas of a signature a (the signature may 

contain constant symbols, symbols for operations and for predicates); Ta{A) - 
the elementary theory of the algebraic system A of the signature a (this means 
the class of all closed formulas from that are valid on A)\ A \= ip - the for- 
mula ip is valid on A. In the terminology concerning first-order formulas and 
theories we follow |2|; in particular, the term algebraic system is equivalent 
to algebraic structure, and the term decidability is equivalent for T),(A) to 
solvability or recursivity and means the existence of an algorithm answering 
for any closed formula ip G To-, whether ip G T„{A) or not. 

In the terminology concerning completely simple semigroups we follow 
Let C ~ M{H,I,J,P) be a representation of a completely simple semigroup 
C by a Rees matrix semigroup, where H is the structure group of C, I and J 
are index sets, and P ^ (pji | j G J, i G /) is the sandwich-matrix. Then C is 
isomorphic to the set of all triples {{h,i,j) \ h G H,i G I,j G J} with binary 
operation • , defined by 



{hiAiJi) ■ (^2,*2, J2) = {hi -Ppi^ ■ h2,iij2), 
where hi -pjpj • h 2 denotes the usual product in the group H. 
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Now let 



H 3 ^{a,a^,e} with = e, N^{1,2,...}, Nq ^ N U {0}, 

and let K ^ , m{i), . . . } be a recursively enumerable (r.e.) subset of 

N. Let 



Pk ^ {Pji I j, i G No), where pji = 



e, if j = 0 or i = 0, 

a, if i > 0 and 0 < j < m(i), 

a^, if i > 0 and J > m(i). 



Let 

Ck ^ M{H3, No, No, Pk)- 

The aim of this paper is to interpret the membership problem for K in 
the elementary theory of Ck- Tbe interpretation will be done in two steps, via 
two-sorted algebraic systems I?(to,Jo) defined below . Let us note that using 
two-sorted systems of another kind helped earlier to prove the undecidability of 
the diophantine theory of a free inverse semigroup with more than one generator 
0 - 

Definition 1. Let M{H, I , J, P) be a Rees representation of a eompletely sim- 
ple semigroup C. For every in G /, jo G J let D(io,jo) be the two-sorted al- 
gebraie system with basie sets H and I x J of the signature < -, 0,71 >, where 
- denotes multiplieation on H; o denotes multiplieation on I x J, defined by 
(*i)Ji) o (* 2 ) J 2 ) = (*i)j 2 ); 7’’ is the symbol of the unary function that maps the 
set I X J into H according to the rule 

'^{{iJ)) = {PPo)~^ -Pji ■ -Pjoio- 

For formulas of the signature < -,0,77 > we will use variables of two kinds: 
j/i, 7 / 2 , • ■ ■ with the domain F[ and Li, I 2 , • • ■ with the domain I x J. Let 

A ^ {D{io,jo) I *0 G I Co G J}. 



Remark 1 . The description of isomorphisms between Rees matrix semigroups P] 
allows us to show that the class A does not depend on a particular Rees repre- 
sentation of a completely simple semigroup C. It depends only on the semigroup 
C, and we can denote it by C{C). 

The following proposition gives an exact interpretation of the membership 
problem for K in the elementary theory of the class A(Ck), for every r.e. K C 
N. 

Proposition 1. There exists a recursive mapping N — >■ L<._o,u->, giving for 
every m gN a corresponding closed formula Sm G L<.,o.7r>, in such a way that, 
for every r.e. K f- N; 

m G RT iff A{Ck) h 
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Proof. Let us fix an arbitrary r.e. set K ^ C N. Let 

P ^ Pk , and let A ^ A{Ck)- Consider the two-sorted system £>(0,0) G A. 
The function tt on £>(0,0) turns out to be as follows: 

7^((£j)) = -Pji ■ {Poz)~^ - Poo =Pji, 

because pjo = poi = e for all i,j G Nq. Now let 

neq{Y,,... ,Yr)^ /\ {Yj,^Y,J, 

col{Yu... ,Yr,y)^ /\ ((r, oyi = ri)A(7r(y,) = y)). 

l<j<r 



Lemma 1. For any m > 1 let 

7m ^ ^ e A (3^1, . . . ,r^)(neg(yi, . . . ,Ym) A col{Yi, . . . ,Ym,y)A 

{'iYm+i){col{Yi, . . . ,Ym+i,y) ->neq(Yi,. . . .F^+i)))). 

Then £>(0, 0) \= 7 m iff m £ K. 

The next lemma gives a property that specifies the system £>(0,0) in the 
class Z\. More precisely, this lemma establishes the negation of such a property. 

Lemma 2. For every fo,Jo G Nq, where io yf 0 or jo yf 0, there exist i\ G 
No \ * 0 ) ji G No \ jo such that in the matrix 

P(.io,3o) ^ ((PAo)”^ ■££ • i.Pjoi)~^ ■ Pjoio I i. * G No) 

the element of the j\-th row and i\-th column equals e. 

Note that all the elements of the lo-th column and of the jo-th row of P{io,jo) 
equal e too. 

Lemma 3. Let f be the following formula: 



(31j, . . . ,Y4){neq{Yi, ... ,> 4 ) A col{Yi,Y2,e) A co/(Y3,l4,e) A -•col{Yi,Y3,e)). 

Then £>(0,0) |= and £>(to,jo) H C for any io, jo G No such that io yf 0 or 
Jo y^ 0. 

Now for every to G N let Sm'^ ^ ((“'0 7m-) Then from Lemmas 1 and 3 

it follows that, for every r.e. £T C N: 

TO G £T iff Z\(C;c) ^6j. 

To obtain 6m it remains to delete the symbol e from SmK i-e. to replace all the 
expressions of the kind y = e by y'^ = y. Proposition 1 is proved. 

The following proposition gives an exact interpretation of £<._o, 7 r>(L\(C')) in 
T<.>{C), for every completely simple semigroup C. 
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Proposition 2. There exists a recursive mapping i<.,o, 7 r> -^<-> > giving for 

every closed formula (p € L<.,o, 7 t> « corresponding closed formula ipf G L< >, in 
such a way that, for every completely simple semigroup C : 
A{C)^piffC^pf. 

The composition of the two mappings given by Propositions 1 and 2 allows 
us to prove the main result of this paper. 

Theorem. There exists a recursive mapping N — >■ T<.>, giving for every m gN 
a corresponding closed formula ipm G T<->, in such a way that, for every recur- 
sively enumerable K C ~N: 

m € K iff iprn is valid on Ck- 



Corollary 1. The complexity of T^.^^Ck) is greater than or equal to the com- 
plexity of the membership problem for K. 

The following corollary gives an example of a completely simple semigroup 
with the three-element structure group and with undecidable elementary theory. 

Corollary 2. If K is a recursively enumerable non-recursive subset of N then 
T<->{Ck) is undecidable. 

References 

1. Clifford, A., Preston G.: The Algebraic Theory of Semigroups, Vol. 1. American 
Mathematical Society, Providence, R.I. (1964) 

2. A. I. Mal’cev, A.: Algebraic Systems. Berlin (1973) 

3. Petrich, M., Reilly, N.: Completely Regular Semigroups. Canadian Mathematical 
Society, Series of Monographs and Advanced Texts 23, A Willey - Interscience 
Publication, New York (1999) 

4. Rozenblat, B.: Diophantine theories of free inverse semigroups. Sibirsk. Math. Zh.26 
(1985) 101-107. English transL: Siberian Math. J., Consultants Bureau, New- York 
- London (1985) 




Quantum Real - Time Turing Machine 



Oksana Scegulnaja* 



Institute of Mathematics and Computer Science, 
University of Latvia, Raina bulv. 29, Riga, Latvia, 
sd60032@lanet . Iv 



Abstract. The principles of quantum computation differ from the prin- 
ciples of classical computation very much. Quantum analogues to the 
basic constructions of the classical computation theory, such as Turing 
machine or finite 1-way and 2-ways automata, do not generalize deter- 
ministic ones. Their capabilities are incomparable. The aim of this paper 
is to introduce a quantum counterpart for real - time Turing machine. 
The recognition of a special kind of language, that can’t be recognized 
by a deterministic real - time Turing machine, is shown. 



1 Introduction 

The quantum mechanism gives us a certain kind of power, which cannot be 
achieved by the deterministic or probabilistic approach. Richard Feynman [Q 
supposed that it might require exponential time to simulate quantum mechanical 
processes on classical computers. It was in 1985, when D. Deutsch introduced 
the notion of quantum Turing machine Pj and proved that quantum Turing 
machines compute the same recursive functions as classical deterministic Turing 
machines do. But it will be proved, that it is possible to find quantum Turing 
machine advantages over deterministic, that have some limitations, such as time 
or tape space. This paper shows, that there is a language that is accepted by a 
quantum real - time Turing machine, but can’t be accepted by a deterministic 
real - time Turing machine. 



2 Preliminaries 

The model of the quantum computing will be described here to introduce the 
notation used further. To get more information on the specific topic please refer 
to PI- The unit of quantum information is the quantum bit or qubit. For a qubit 
the possibility to be 1 or 0 is stated as ||a||^ + ||6||^ = 1, where a and b are the 
arbitrary complex numbers. If we observe qubit, we get true with probability 
||a||^ and false with probability ||6||^. 
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We consider quantum systems with m basis states |<7i), \q2),..., |< 7 m)- Let y 
be a linear combination of them with complex coefficients 

Ip = ai |gi) + 02 1 ^ 2 ) + + am \q2) ■ 

The norm of ip is 




The state of quantum system can be any %p with Hr/jH = 1 . ip is called a super- 
position of |(7i), |(?2)v> ai,a2, are called amplitudes of |gi), |g2)v> 

l^m) ■ 

There are two types of transformations that can be performed on a quantum 
system. First, there are unitary transformations. A unitary transformation is a 
linear transformation U that preserves norm (any ip with ||'!/)|| = 1 is mapped to 
Ip' with ll' 0 'll = 1 ). 

Second, there are measurements. The simplest measurement is observing xp = 
ai |gi) -I- 02 1^2) + + am 1^2) in the basis \qi), \q2),—, \qm)- It gives \q^) with 

probability of. After the measurement, the state of the system changes to \qi). 

3 Definitions 

Definition 1 . Real - time deterministic Turing machine (TM) is a set M = 
{E, S^,Q,qQ,qf, I) , where E- finite alphabet (of input symbols), including sym- 
bols # and $, E„, - finite alphabet (work tape symbols), including symbol X, 
Q - set of states, qo - initial state, qf - final state, { — — ,t} - move- 
ments of the head (left, right, stop), I - set of instructions. Instruction is a 
row E * E.u,*Q — Q * E„u * { — < — , t}- 

Such a machine has one endless input tape and one endless work tape with one 
head moving on each tape. At the beginning machine is in the state qo, the input 
tape head is on the first symbol of the word from the left. Work tape is empty 
and TM reads the first symbol of the word from the input tape. As a second step, 
it reads the second symbol, etc. After the last symbol it reads symbols $. Let 
it be the instruction of I: xyqk — > qjz — >, where current state is qk G Q, the 
machine is reading symbol x G E and the work tape head is observing symbol 
y G Eyj. Then the machine moves to the state qj, replacing the symbol y with 
z, and moves to the right. 

Real - time TM every moment reads a new symbol. When the symbol $ has 
been read, the work is finished. Real - time TM accepts the word, if the work is 
finished, the working tape contains one symbol ” 1 ”, the rest of the tape is filled 
with ” A” and the head observes the symbol ” 1 ” . It rejects the word, if the work is 
finished, the working tape contains one symbol ” 0 ” , the rest of the tape is filled 
with ” A” and the head observes the symbol ” 0 ” . Real - time TM recognizes the 
language L in time t(x), if for every word x G L exists a set of instructions from 
I, that needs not more than t(x) steps to obtain the result 1 , and there isn’t any 
word X G L that a set of instructions from I for x leads to the result 1 . 
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Definition 2. Real - time quantum Turing maehine (QTM) is a set M = {E, 
^w, Q, qo, qf, S), where S - finite alphabet (of input symbols), ineluding symbols 
# and $, Sm - finite alphabet (work tape symbols), including symbol X, Q - 
set of states, qo - initial state, qp - final state, transition amplitude mapping 
S : Q * E * Euj * E^ * Q * { — — ,f} — > ^[0, 1] is required to be such, that 
quantum evolution of M is unitary. That means that quantum evolutions of M 
can be defined as unitary matrices U, where UU* = I and U* is a conjugate 
transpose of U, i.e. the transposition of U and conjugation of its elements, and 
I is the unit matrix. 

To be quantum Turing machine, it has to meet so-called well-formedness condi- 
tions (see more about unitarity conditions in ^j). 

After each step the measurement is performed. 

Let it be 5 : qkxyzqj — > — > where current state is qk G Q, the machine 

is reading symbol x G E and the head is observing symbol y € Ew. Then the 
machine moves to the state qj, replacing the symbol y with z, and moves to the 
right. Real - time QTM every moment reads a new symbol. The moment, when 
the symbol $ has been read, the work is finished. Real - time QTM recognizes the 
language L with amplitude Z\(Z\) |), if M working on any word x with amplitude 
not less than A accepts x, if cc € L, and rejects x, if a; ^ L. 

Theorem 1. There is a language L, that can be recognized by a real - time 
QTM, but can’t be recognized by a real - time deterministic TM. 

Proof Let L = where x, y = {0,1}*. 

First, we’ll prove, that L can be recognized by a real - time QTM and second, 
we’ll prove, that L can’t be recognized by real - time deterministic TM. 

The first step is to split the process into three states, one of which is rejecting 
state qrr, and qi and q 2 compare x and a;’"®" and y and y’'®” respectively. 

Than the word belongs to the language L, if both the branches qi and q 2 say 
”yes”, and doesn’t, if any of the branches says ”no”. 

qo - initial state, {qa,qaa} ~ a set of accepting states, {qr,qrr} ~ a set of 
rejecting states. 

Here is a transition function of the QTM. Each vector describes a transition, 
where the first symbol is that one that is read from the input tape, and the 
second one is read from the work tape. Symbol ”e” means, that it can be any 
symbol. Transitions, that don’t change the state of QTM, are not shown. 
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All the other transitions are arbitrary such that the transformations are unitary 
(see more in j^j). 

Then the first branch moves like that: 

qi - reads x and writes it down to the work tape, moving to the right; - 
waits while y is read; q^ - reads and goes to the left the work tape, comparing 
X and cc’'®"; q^ - waits while is read. 

The second branch: 

q 2 - waits while x is read; q^ - reads y and writes it down to the work tape, 
moving to the right; q^ - waits while cc’’®’' is read; q^ - reads y’’®’' and goes to the 
left the work tape, comparing y and y®®^. 

If the branch finds the difference between x and a;’'®^ or y and y’’®’', then it 
goes to one of the rejecting states. Otherwise after the symbol $ is read it goes 
to one of the accepting states. 

If going backward the work tape the symbol read from the work tape doesn’t 
match the symbol read from the input tape, QTM goes to the rejecting states. 

When the symbol & is read from the input tape, the QTM changes its state. 
When the symbol $ is read from the input tape, the QTM finishes work. Then 
the working states go to the accepting states. 

So if the word is from L, then both of the branches say ”yes” with probability 
2/3; If the word is not from L, then it is rejected by at least one branch, and in 
total the word is rejected with probability >= 2/3. 

Language L can’t be recognized by a deterministic TM in time |r<;|, that is 
the length of the word w € L (see the standard proves Q, m, iHi). 

4 Conclusion 

We have proved the existence of the language class, that can be recognized by 
real - time QTM and can’t be recognized by real - time TM. That means that 
QTM with limitations can be more powerful than deterministic one. 
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Abstract. This paper concerns issues related to building mathematical 
models and optimal algorithms of stacks and queues fP control in single- 
and two-level memory. These models were constructed as 1, 2 and 3- 
dimensional random walks. In our opinion the algorithms, constructed 
for concrete data structure, will work better, than universal replacement 
algorithms in paging virtual and cash-memory. It was confirmed by the 
practice of construction of stack computers [21. This research work was 
supported by the Russian Foundation for Fundamental Research, grant 
01-01-00113. 

1 Introduction 

In the case of single-level memory, several methods of stacks and queues [Q 
presentation in memory may be used. The connected presentation is the first 
method. In this case any number of lists can coexist inside a shared memory area 
until the free memory list is exhausted. But on the other hand, this approach 
requires an additional link field for each element. The second method (Garvic’s 
algorithm) uses the consecutive allocation of one list after another. 

It is also possible to allocate stacks consecutively and divide them into pairs 
of stacks growing towards each other |3| . In the case of queues Knuth determined 
that operating two queues in common memory is impossible Q(ex. 2.2.2, N 17), 
however, he does not seem to consider the possibility of the travel of two queues 
one after the other around a circle. This is the case we analyze here, as in it 
has been shown that this method of stacks control is optimal. 

2 On the Problem of Optimal Stack Control 

2.1 The Optimal Allocation of n Stacks in Single-Level Memory 

Let we have a memory of m > 0 conditional units and there are n stacks divided 
into pairs of stacks growing towards each other. Let qi^Pi denote the probabilities 
of deletion and insertion information into the i-th stack. Deletion from empty 
stack with probability Qi means that process remain on the same state. The 
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process starts from the state, when stacks are empty. Our task is to determine 
the optimal initial memory distribution and optimal memory redistribution in 
case one of the stacks overflows, when optimality is understood in the sense of 
maximizing the average functioning time until stack overflow. More exactly the 
problem will be stated for n=3. 

2.2 The Optimal Allocation of Three Stacks in Single-Level 
Memory 

Suppose there are three stacks located in a memory area of volume m units. The 
pair of stacks growing towards each other occupies s memory units and m — s 
memory units are left for the third stack. Let xi, X 2 , X 3 denote the current stack 
heights. In this case the mathematical model is the three-dimensional random 
walk inside a prism with three reflecting barriers x± = —1, X 2 = —1, X 3 = — 1 
and two absorbing barriers x\ + X 2 = s + 1, X 3 = m — s + 1. The problem of 
finding optimal initial memory distribution consists in determining the value of 
s and numbering of the stacks (i.e. determining the number of the stack to be 
placed separately from the others), to maximize the average time of walk inside 
the prism until the absorption in its border, provided the process begins from 
the origin. In other words this task is reduced to choosing an optimal prism. 

The case of three stacks, when only insertions are assumed. In this case 
we have the probability of moving out of the position (xi,X2, X3) to the position 
{x\ + 1 ,X 2 ,X 3 ) — pi, to the position {x\,X 2 + — P 2 , and to the position 

(xi, a; 2 , 2:3 -I- 1) — P 3 where pi + P2 + Pz = 1- Let x = x\ + X2, y = X3, p = 
Pi +P2, q = Pz- 

Then the 2-dimensional random walk in the integer lattice space, where 0 < 
x<s,0<y<m — s is used as a mathematical model. The probability of 
moving out of the position (x, y) to the position (x-l- 1, y) — is p, to the position 
{x,y + 1) is q — I — p. The process starts from the state (0, 0) and is absorbed 
at the lines x = s -I- 1 and y = m — s + 1. The objective of our study is to And 
the value s, where the mean time of walk to the absorption would be maximal. 

2.3 The Optimal Allocation of Two Stacks in Two-Level Memory 

Let two stacks grow and collide inside a shared memory of volume m. 

D. Knuth ^ posed the problem to construct the mathematical model of this 
process. In 0-C3 the mathematical model of the process has been constructed 
as the two-dimensional random walk in a triangle with two reflecting boundaries 
and one absorbing boundary. 

In order to solve the problem, the Markov chain theory was used. 

In j01-|II]| the asymptotic behavior of stack sizes at the instant of the overflow, 
as well as the time till the memory overflow were investigated. 

In the present paper we consider a generalization of Knuth’s problem. In the 
case of two-level memory the problem is reduced to choosing optimal initial state 
of the 2 -dimensional random walk inside the triangle until the absorption in its 
border. 
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2.4 The Optimal Management of Three Stacks in Two-Level 
Memory 

In the case of two-level memory the problem is reduced to choosing optimal initial 
state of the 3-dimensional random walk inside the prism until the absorption in 
its border 

3 On the Problem of Optimal Queue Control 

3.1 The Optimal Allocation of Single Queue in Two-Level Memory 

Suppose there is cyclical consecutive queue in a memory area of volume n 
units. We assume, that in the case of memory overflow the process does not ter- 
minate. Instead, swapping with the second level memory occurs, and beginning 
of queue in fast memory transfer into state yo, and (n — yo) last elements swap 
to second-level memory. 

Let y-denote the current height of beginning of queue in fast memory, x- 
denote current height of new last elements in fast memory, p- denote the prob- 
ability of insertion information, q = 1 — p- denote the probability of deletion 
information from queue, m = n — 1- denote the maximal number of elements 
in the queue. Then the 2- dimensional random walk in the integer lattice space, 
where x>0,2/>0,a:-|-?/<mis used as mathematical model. The probability 
of moving out of the position (x, y) to the position (a; -I- 1, y) is p, to the position 
(a;, y — 1) is q = 1 — p. The process starts from the state (0, yo) and is absorbed 
at the lines x -I- y = m -I- 1 and y = — 1 . 

The objective of our study is to find the state yo, where the mean time of 
walk to the absorption would be maximal. 

3.2 The Optimal Allocation of Two Queues in Single-Level Memory 

Here we consider the problem of control over several queues in the memory of the 
same level. Assume that we want to work with k cyclic queues in the memory of 
the size n. We can arrange the operation of the queues as follows: to separate an 
individual area in the memory for each queue or to separate a common area for 
some (probably, for all ) queues and to operate inside the shared pieces so that 
the queues should travel one after the other around a circle. In any arrangement 
the following problem emerges: how much memory should be separated to each 
queue depending on its probabilistic properties? And, of course, it is necessary 
to answer the main question : which of the arrangements would have greater 
mean time to overload? 

The model of the process is made up for the case k=2. Let us denote the prob- 
abilities of insertion and deletion of the elements in the queues by pi, gi,p 2 , y 2 - 
When the queues are stored individually, let us denote current lengths of 
queues by x and y, the length of memory separated to the first queue by s. Then, 
we’ll have, as a model, two-dimensional walk over integer lattice in the region 
0<x<s,0<y<n — s with the corresponding probabilities of transitions. 
The walk begins in the point x = y = 0, here the lines x = —1 and y = — 1 
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are the reflecting barriers, as well as the lines x = s and y = n — s - absorbing 
barriers. It is proposed to choose s so that the expectation value of the time to 
absorption would be maximum. It is the optimal value that must be used for 
the comparison with the case of shared memory. 

In the case of shared storage of queues let us denote current lengths of queues 
by X and y, and the distance from the end of the first queue to the beginning of 
the second one by 2 :. Then we’ll have, as a model, 3-dimensional walk over the 
integer lattice in the region 0 < x,0 < y,0 < z,x + y + z < n. Being in the state 
(x, y, z) where 0 < x, 0 < y , the process transfers into the state {x + l,y, z — 1) 
with the probability pi, into the state (x — l,y,z) with the probability qi, into 
the state (x,y -I- l,z) with the probability p 2 , into the state {x,y — l,z + 1) 
with the probability < 72 - In the state (0,y, z) and (x, 0, z) the walk remains on 
the spot with the probabilities qi and ( 72 , respectively. The planes z = — 1 and 
x-|-y-l-z = n-|-l are the absorbing barriers. It is proposed to And the values 
0 < zo < n, so that at the exit from the state (0, 0, zq) the expectation value of 
the time to absorption would be maximum. 

The tasks are solved with the help of the apparatus of absorbing Markov 
chains as well as the construction and solution of difference equations. The al- 
gorithms as well as computational programs in C-|— I- for this problems were 
developed. In the report the results of numerical experiments have been anal- 
ysed. 
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Abstract. The recognizable subsets of free monoid semirings are 
investigated. They are defined using linear automata as recognizers. For 
a subset of a free monoid semiring, we define the Myhill and Nerode 
congruences and prove a generalization of the well-known Myhill-Nerode 
Theorem. We consider recognizable subsets of the semiring of natural 
numbers. It is shown that every finite subset of and every ideal of No 
are recognizable. Examples of recognizable subsets of free semirings are 
also given. 

Keywords: dynamical (linear) system, recognizable set, semiring. 



In 1967 Elinberg considered reconizable subsets in arbitrary monoids. Note that 
in this case recognizable and rational subsets do not necessarily coinside. By 
analogy with monoids various concepts of recognizability have been proposed for 
other algebraic structures. Recognizable subsets were defined in general algebra 
by Mezei and Wright in terms of finite congruences (see 21) • ^.n equivalent 
definition suggesting the idea of finite algebras as recognizers was given by Stein- 
by|S|. On the other hand, dynamical (or linear) systems over semirings have been 
actively investigated for years (we refer to and |^). Therefore it is of interest 
to study languages that can be recognized by input semimodules of such systems, 
which are called linear automata. 

In this paper we consider recognizable subsets of free monoid semirings and 
show that they can be defined using linear automata as recognizers. For a subset 
of a free monoid semiring, we define the Myhill and Nerode congruences and 
prove a generalization of the well-known Myhill-Nerode Theorem. It follows that 
in case of free monoid semirings our definition of recognizability is equivalent to 
the definition by Mezei and Wright and to the definition by Steinby. Finally we 
consider recognizable subsets of the semiring of natural numbers. In particular, 
we show that every finite subset of and every ideal of Nq are recognizable. We 
also give examples of recognizable subsets of free semirings. 

We refer to |5| for background concepts on semirings. Throughout, we con- 
sider semirings with neutral element 0 and identity 1. A right semimodule over 
a semiring S (or right S'-semimodule) is defined as an additive semigroup (A, +) 
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in which for all a G A, s G S' an action as G A is given so that the following con- 
ditions hold: a(rs) = (ar)s, al = a, aO = Os = 0, (a + b)s = as + bs, a{r -b s) = 
ar + as for all a,b G A and r, s G S. Left S-semimodules are defined dually. 
A semiring R which has also a structure of an S-semimodule is called an S- 
semialgebra. 

Let S be a semiring, and let Mods be a category of right semimodules over 
S. Every system (A,X,o) = (A, AT) is called a linear automaton (in the variety 
Mods), where 

• A G Mods is a semimodule of states; 

• AT is an alphabet of input signals; 

• o:AxA'— lAis the transition function; for every x G X, the mapping 

a I— >■ a o a; is an S-homomorphism. 

For a monoid M , the definition of the monoid semiring S[M] is similar to 
that of a monoid ring. In particular, given a set X, the free monoid semiring 
5'[X*] consists of all polynomials over S in non-commuting variables of X. 

Given a linear automaton (A,X, o), where A G Mods, one can extend it to 
a linear automaton (A, SX* , o) over the monoid semiring SX* by extending the 
transition and output functions as follows: 

a o {siUi + S 2 U 2 ) = asi o ui -I- as 2 o U 2 - 

We shall investigate recognizable subsets in the monoid semiring S'[A1*] mo- 
tivated by the following 

Proposition 1. The monoid semiring S'[A'*] is a free S -semialgebra over X, 
or, in other words, it is a free semiring over X in the subvariety of all semirings 
in Mods. In particular, the monoid semiring Nq[W*] is a free semiring over X 
in the variety of all semirings. □ 

A linear recognizer (A,ao,F) in the variety Mods consists of a linear au- 
tomaton (A, AT*) in Mods, an initial state Oq G A and a set F C A of final 
states. An linear recognizer (A, ao,F) is finite if A is finite. The set recognized 
by a linear recognizer (A, ao, F) is 

L(A) = {u G S[X*] \ao-uGF}. 

Further, we present some algebraic characterizations of recognizable subsets 
in FiAT*]. 

For arbitrary subset L C ^[Ar*], define the following equivalence relations on 
FiAT*]: 

u 9 l v ii and only if (Va G Vw, rc' G AT*, Vs G S) 

[wusw' -b a G L <t4> wvsw' -b a G L] 
u pl V ii and only if (Va G S'[X*],Vw G X* ,'is G S) 

[usw -b a G L vsw -b a G L] . 
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Proposition 2. 9l is the greatest congruence on 5'[X*] which saturates L. 

Proof. Let us prove, for example, that is stable with respect to multiplication. 
Consider an arbitrary nonzero element c = riCi + • • • + r„c„ G where 

ri G S,Ci G X*. We are going to show that (u,v) G 9l implies (uc,vc) G 9l- 

Take arbitrary elements s G S, a G S'[X*] and w, w' G X* . Then 

w{uc)sw' + a = wurisciw' + {wur 2 SC 2 w' + • • • + wurnSCnw' + a) G L 
wvrisciw' + {wur 2 SC 2 w' + • • • + wurnSCnw' + a) 

= wur 2 SC 2 w' + (wvrisciw' + • • • + wurnSCnw' + a) G L 

wvr 2 SC 2 w' + (wvrisciw' + • • • + wurnSCnw' + a) G L 

<t4> . . . 

wvrnSCnw' + {wvrn-iSCn-iw' + • • • + WVriSCiw' + a) = w{vc)w' G L. 

Similarly, (uc,vc), {u + c,v + c) G 9l- Thus 9^ is a congruence. It is clear that 
every congruence that saturates L is contained in 9 l- □ 

The following proposition can be proved similarly. 

Proposition 3. pi^ is the greatest right congruence on 5'[X*] which saturates 
L. □ 

Proposition 4. {S[X*]/ p]^,X*) is a minimal linear recognizer of L. □ 

Given a linear automaton (A,X) in A, we have a natural homomorphism p 
from the semirings S'[X*] to the semiring of all S'-homomorphisms of A defined 
by 

ap{u) = au, for u G SP, a G A. 

The semiring S'[X*]/ker/i = /x(S'[X*]) is called the semiring of transitions of 

(AX). 

Proposition 5. S[X*]/9l is isomorphic to the semiring of all transitions of the 
linear automaton {S[X*]/p]^,X*). □ 

The following theorem is a generalization of the Myhill-Nerode Theorem and 
gives a few characterizations of recognizable sets. 

Theorem 1. For arbitrary subset L C the following conditions are equiv- 

alent: 

(1) L is recognized by a finite linear recognizer (A,ao,F) in the variety A; 

(2) L is saturated by a congruence of S[X*] of a finite index; 

(3) 9l is of a finite index; 

(4) L is saturated by a right congruence of S[X*] of a finite index; 

(5) pl is of a finite index; 

(6) there exist a finite semiring A, a homomorphism 4 >: S'[X*] — >■ A and a subset 

F Q A such that L = F(j)~^. □ 
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Theorem Q] allows us to use facts about recognizable subsets in the sense 
of the definition mentioned in condition (6). Namely, let A be any algebra of 
signature A'. The set Rec(A) of recognizable subsets of A is defined as follows 
(see jH|). A subset L of A belongs to Rec(A) if and only if there exist a finite 
A-algebra B, a homomorphism of A-algebras (f>: A ^ B and a subset F C B 
such that L = F<p~^ . 

In the case of the semiring Nq, 

Sl = Pl = {(tt, v) G No X No \ u + b G L V + b G G No}. 

Proposition 6. Every finite subset of the semiring No is recognizable. 

Proof. Let L be a finite subset of Nq, and let a be the greatest element of L. 
Then all elements of No greater than a are in the same class of 9l. Therefore 6l 
is of a finite index, and so L is recognizable. □ 

Theorem 2. Every ideal of the semiring No is a recognizable subset. 

Proof. First, we claim that very principal ideal of the semiring No is recognizable. 
Indeed, consider an arbitrary principal ideal oNo. We are going to show that 
every two elements of No that have the same residue modulo c are in the same 
class of 0aNo ■ Take any two elements u = aui + p and v = avi + p. Assume that 
u+b G oNo, for some b G Nq. We have u+b = aui+p+b = ak, for some k. Hence 
ak — aui = p+b> 0, and therefore v + b = av\ +p+b = a{vi + k — u\) G oNq. 
We get (m,u) G 0aNo- Thus Oa^o has at most a classes, and so the ideal oNq is 
recognizable. 

Further, take an arbitrary ideal I of No. It was proved in ^ that for every 
ideal / of No there exists a finite subset A of No such that / U A is a principal 
ideal of No. Obviously, we can choose a set A of this kind so that / fl A = 0. 
Since the set /U A is a principal ideal, it is recognizable in view of the preceding 
claim. By Proposition^ the set A is recognizable, too. Since both the sets / U A 
and A are recognizable and / fl A = 0, it follows from the results of |n| that the 
set / = (/ U A) \ A is recognizable. □ 

Corollary 1. Let Li = {a; G Nq | a; < a}, for some a G N, and let L 2 be a 
recognizable language in X* . Then the subsemigroup L of (Nq[W*],+) generated 
by L 1 L 2 is a recognizable set. □ 
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Abstract. A description of the property of monotonicity of Boolean 
functions using propositional calculus is presented, which allows to use 
#SAT algorithms for computing Dedekind numbers. Using an obvious 
modihcation of Davis-Putnam satisfiability algorithm. Dedekind num- 
bers until the seventh have been calculated. Standard arithmetization of 
propositional logic allows to deduce Kisielewicz’s formula in more trans- 
parent way. 

1 Dedekind Numbers 

Function /: {0,1}" — >■ {0,1} is a Boolean function of n variables. Let a,/3 G 
{0, 1}". Let a ^ P = 3i [ai < Pi Sz Vj ((j z) — >■ {aj = Pj))]. By we denote 

the transitive closure of the relation Function / : {0, 1}" — >■ {0, 1} is monotone, 
if /(oi,... ,a„) < f{Pi,... ,/3„) whenever (ay... , a„) {Pi,... , Pn) 

Problem: How many different monotone n-variable Boolean functions there 
exist? 

The problem of finding the number, D{n), was suggested by Dedekind (|2|)- 
Within more then 100 years Dedekind numbers up to 8 were calculated. Although 
A. Kisielewicz at 1988 (0) found a closed formula for D{n): 



where = [A:/2®] — 2 [fc/2*+^] , it does not help very much in computing D{n). 
For n = 6 we must sum 2^'^ values which is out of limits of the fastest computers. 

2 Logical Description of Boolean Function Classes 

Our goal is to describe the notion of monotonicity using family of proposi- 
tional formulae. Every Boolean function of n variables can be represented by 
its truth-table, so we use 2" propositional variables xq, . . . ,X 2 ^-i, one variable 
for each entry of the truth-table. For convenience we suppose that indexes of 

* Supported in part by Estonian Science Foundation grant no. 3056 
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the variables are binary numbers. The correspondence of the Boolean function 
/: {0, 1}" — >■ {0, 1} to the bit-string /o, ■ • ■ , / 2 "-i is defined by / (a) = fa for 
every a G {0,1}". Using this encoding, we can consider propositional formu- 
lae with variables xq, . . . ,X 2 "-i as descriptions of the properties of n- variable 
Boolean functions. 

Theorem 1. Function f: {0,1}" — >■ {0,1} with truth-table fo,--- ,/ 2 "-i is 
monotone if and only if truth assignment fo,... ,/ 2 "-i satisfies propositional 
formula 



(xq, . . . ,X2^ — l) — (Xa V Xjg) . (2.2) 

a,/3G{0,l}";a-;/3 

We omit the proof, which is similar to the proof of Theorem|21 Another possi- 
bility to get Dedekind numbers is by counting anti-chains. Function /: {0, 1}" — ?> 
{0,1} is anti-chain, if f(a\,... , a„) = 0 or f {f3\, . . . , Pn) = 0, whenever 
(tti, . . . , a„) (/?!, . . . , (3n). One can establish a bijection between anti-chains 

and monotone functions by considering anti-chain as a collection of “minimal 
ones” (or alternatively as a collection of “maximal zeros”) of a monotone func- 
tion. 

Theorem 2. Function f : {0,1}" — >■ {0,1} with truth-table fo,... ,/ 2 "-i is 
anti-chain if and only if truth assignment fo, . . . ,/ 2 "-i satisfies propositional 
formula 



An {xo,... ,X 2 ^-l) = f\ (Xa^/Xfs) . (2.3) 

Of , /9 G { 0 , 1 } ; a -< + /3 

Proof. 1. Let / be anti-chain. By definition of anti-chain for every a,(3 G {0, 1}", 
a /3 implies / (a) = 0 or / (/3) = 0. Suppose An{fo,.-. ,/ 2 "-i) = false. 
Then at least one of the clauses of An, say Xa'^xp, must be false for assignment 
fo, . ■ . , / 2 "-i- It is possible only if /„ = 1 and = 1, which means that / (a) = 
1 and / {(3) = 1. From the definition of An we know that a (3 and / is not 
anti-chain. Contradiction. 

2. Let Anifo,... ,/2"-i) = true. Suppose / is not anti-chain, then there 
exist a,P G {0,1}" such that a. /3, but /(a) = 1 and / (/3) = 1. Then 
the clause Xa V xp is false and the assignment {fo,... ,/ 2 "-i) violates An. 
Contradiction. 

3 Arithmetization 

Standard method for representing propositional formula A{x\,... ,Xn) as a 
polynomial [A{xi, . . . ,Xn)] consists of two steps: 

Step 1. Transform propositional formula into equivalent formula, which con- 
tains Boolean operators A, — >■, -■ only. 

Step 2. Change Boolean operators into arithmetical ones: 
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[BAC] [B] ■ [C], [B ^ C] ^ 1 - [S] + [S] • [C], hS] ^ 1 - [B], 
where B and C are propositional formulae. A (cci, . . . , Xn) = [A (xi, . . . , Xn)] for 

X\j . . . , Xji G {0,1}. 

Theorem 3 ((A. Kisielewicz, 1988)). For any n>\ 

2^" / logji 

D{n ) = E n n 1 - It (1 - 

k—l 3 = 1 i— 0 \ m— 0 

where h’l = [fc/2*] — 2 [fc/2®+^] . 

Proof. We present here an alternative proof, which consists of transformation 
and arithmetization of the formula, defining anti-chain from theorem El 

An{xo,... ,X 2 ’^-l) = /\ {Xa'^X/ 3 ) . 

a,/3G{0,l}"-;a^ + /3 

The condition a f) can be removed inside as the antecedent of the impli- 
cation. 




(xo, . . . ,a;2"-i) = /\ /\ (a ^ +/?)-)■ (xa V a;/3) . 

aG{0,l}" /3G{0,1}" 

Using the fact, that a f3 = Am=i i^m < Pm) and that for every x,y G 
{0, 1}, X < y = X ^ y, we can rewrite the formula as 

An (Xo, . . . ,X2n_i) = A A A (Orri t Pm) | ^ (Xa V Xff) 

aG{0,l}" /3 g{ 0,1}" \m=l / 



Using the equivalence from propositional logic x — >■ (yVz) = -<{x Ay A z) 
and commutativity of conjunction we receive 



Ai (Xo, . . . ,X2n-l) = f\ f\ 
aG{0,l}" /3 g{0,1} 

After arithmetization we receive 



A Xp A I {eXm, t Pm) 



[An (xo, . . . ,X 2 n-l)] 



n n 

/3g{ 0,1}" aG{0,l}" 



1 - XaXp (1 - 






iPm) 



m=l 



Let bfbf_^ . . . b\b^ be the binary representation of the integer k. Then b\ = 
[k/T] - 2 [ n/2®+^] and we can use integers instead of bit-strings. 

2"-12”-l / n 

[An (xo, . . . ,X 2 n-l)] = n n ( ^ n + KMn) 

j—Q 2^0 \ m— 0 
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For every i,j (0 < i,j < 2”) , . . . b\bQ Hi - ■ ■ implies i < j, so we 

must consider only cases i < j and compute product n:=o (1 ^mHn} 

m < log2 i- 

2’^-lj-l / logs* 

[An (xo, , X 2 r^-l)] = n n U n (^ - + HnHn) 

j—1 i— 0 \ m— 0 

Bit-string f = fo . . . / 2 "-i is truth-table of some monotone function if and 
only if [An (/o) ■ ■ • ) / 2 "-i)] = 1- For computing D{n) we have to add these values 
for all bit-strings from {0, 1}^ or, otherwise, for binary representations of all 
natural numbers k from 0 to 2^ — 1, i.e. 

2^"-12”-lj-l / logji 

D{n) =Y. n n 1 It (1 - 

k—0 j—1 i—0 y m— 0 

The value D{n) does not change, if we change limits of the outer sum to 
Sfe=i because the formula does not use the most significant bit of 2^ : 

2^" 2"-lj-l / logs* 

D{n ) = E n n 1 n (1 - + ^hki) 

k—l j—1 2—0 V m—0 

4 Concluding Remarks 

We have already mentioned above, that Kisielewicz’s formula does not help us 
in computing D{n) for n > 5. Better idea might be computing the number of 
satisfying assignments of Ain or An using some method for counting solutions 
of propositional formulae. An obvious modification of Davis-Putnam algorithm 
(P) for counting satisfying assignments calculated D{7) but could not cope 
with calculating D{8). We see at least two directions for future work. Firstly, it 
would be interesting to try to reduce other combinatorial counting problems to 
#SAT. Secondly, Dedekind numbers can be considered as a benchmark for #SAT 
algorithms and this gives a good reason to develop better methods for counting 
solutions. Counting methods for restricted classes of CNF are also interesting 
because the formula An is anti-monotone, 2CNF and Horn formula. 






References 

1. Davis, M., Putnam, H.: A computing procedure for quantification theory. J. Assoc. 
Comput. Mach. 7 (1960) 201-215 

2. R. Dedekind, R.: Uber Zerlegungen von Zahlen durch ihre grossten gemeinsamen 
Teiler. Festschrift der Technischen Hochschule zu Braunschweig bei Gelegenheit der 
69 Versammlung Deutscher Naturforscher und Arzte (1897) 1-40 

3. Kisielewicz,A.: A solution of Dedekind’s problem on the number of isotone Boolean 
functions. J. reine angew. Math. 386 (1988) 139-144 



A General Method for Graph Isomorphism* 



Gabriel Valiente 

Department of Software, Technical University of Catalonia, E-08034 Barcelona 



Abstract. A general method is presented for testing graph isomor- 
phism, which exploits those sufficient conditions that define linear or- 
derings on the vertices of the graphs. The method yields simple and 
constructive, low-order polynomial graph isomorphism algorithms for 
classes of graphs which have a unique ordering, or a small (not nec- 
essarily bounded) number of different orderings. The general method is 
instantiated to several graph classes, including: interval graphs, outer- 
planar graphs, biconnected outerplanar graphs, and triconnected planar 
graphs. Although more efficient algorithms are known for isomorphism 
on some of these classes of graphs, the method can be applied to any 
class of graphs having a polynomial number of different orderings and 
an efficient algorithm for enumerating all these orderings. 



Considerable effort has been devoted over the last three decades to graph isomor- 
phism and related problems, motivated by the many practical applications but 
also by the failure of all attempts to determine the complexity of the graph iso- 
morphism problem. A comprehensive survey of the so-called graph isomorphism 
disease can be found in [3,9], while complexity aspects of the graph isomorphism 
problem are treated in much detail in [2,6,10]. 

The concept of graph isomorphism lies (explicitly or implicitly) behind al- 
most any discussion of graphs, to the extent that it can be regarded as the 
fundamental concept of graph theory. Two graphs are isomorphic if there is a 
bijective correspondence between their vertex sets which preserves and reflects 
adjacencies — that is, such that two vertices of a graph are adjacent if and only 
if the corresponding vertices of the other graph are adjacent. 

Definition 1 . Two graphs Gi = {Vi,Ei) and G2 = (V2,i?2) are isomorphic, 
denoted by Gi = G2, if there is a bijection h \ V\ ^ V2 such that {m,u} G Ei if, 
and only if, {h{u),h{v)} € E2, for all u,v € Vi. Such a bijection h is called a 
graph isomorphism between G\ and G2. 

Algorithms for testing isomorphism of general graphs are usually based either 
on invariants or on certificates [7]. An invariant is a necessary condition for 
isomorphism, a property that does not depend on the presentation (labeling) of 
a graph. A certificate is a necessary and sufficient condition for isomorphism, a 
description of a unique representative of a graph in a given isomorphism class. 

* Partially supported by the Spanish CICYT project TIC98-0949-C02-01 HEMOSS. 
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Unfortunately, no certificates have been found for general graphs which are 
not either equivalent to the straight definition of isomorphism, or at least as 
difficult to compute. Invariants, on the other hand, are useful for demonstrating 
non-isomorphism, but nothing can be concluded about two graphs which share 
an invariant. 

In some classes of graphs, though, a sufficient condition for graph isomor- 
phism can be stated which defines a unique ordering or numbering of the ver- 
tices — or a unique numbering of the edges that induces a unique numbering of 
the vertices. For a graph G = (V,E), where E C V x V, n and m will be used 
to denote the number of vertices and edges in G, respectively. 

Definition 2 . A numbering of a graph G = (U, E) is a bisection / : U — >■ 

{l,...,n}. 

When a sufficient condition for isomorphism in a given class of graphs pro- 
duces a unique numbering of the vertices of a graph, it can be easily turned into 
a necessary and sufficient condition. As a matter of fact, since such numberings 
are one-to-one functions, they define a bijection between the vertices of any two 
given graphs. Then, since the test for graph isomorphism given a vertex mapping 
takes linear time, if the sufficient condition for isomorphism in a special class of 
graphs can be tested in low-order polynomial time this yields a simple low-order 
polynomial-time method for testing isomorphism in the given class of graphs. 

Definition 3 . A numbering f of a graph G\ = {V\,Ei) together with a num- 
bering g of a graph G2 = {V2,E2) induee a one-to-one mapping h : Vi ^ V2, 
called the vertex mapping induced by f and g and defined by h{u) = v if and 
only if, f{u) = g{v), for all u € Vi and v & ¥2- A vertex mapping h : Vi ^ V2 is 
adjacency-preserving if {hfu), h(v)} € E2 for all {u, v} € Ei, and it it adjacency- 
reflecting if {u,v} € El for all {h{u),h{v)} € i?2- 

Lemma 1 . Two graphs Gi = (Ui,ifi) and G2 = (V2,if2) are isomorphic if, 
and only if, there is a vertex mapping h : Vi ^ V2 which preserves and reflects 
adjacencies. 

Proof. Follows directly from Def. 1 and Def. 3 . □ 

Proposition 1 . Two graphs Gi and G2 are isomorphic if, and only if, there is 
a numbering f of G\ and a numbering g of G2 such that the vertex mapping 
induced by f and g is adjacency-preserving and adjacency-reflecting. 

Proof (if) Immediate, (only if) Let Gi = (Vi,Ei) and G2 = (V2,E2) be two 
isomorphic graphs, let ft. : Ui — >■ V2 be an adjacency-preserving and adjacency- 
reflecting vertex mapping, let {vi, . . . ,Vn) be an arbitrary permutation of Vi, 
and let / : Vi — >■ { 1 , . . . , n} be the bijection defined by f{vi) = i for all Vi £ Vj. 
Let also g : V2 —>■ { 1 , . . . ,n} he the bijection defined by g(v) = f(h~^(v)) for all 
v € V2. Then both / and g are numberings. Moreover, since for all u £ Vi and 
V £ V2, h(u) = v if, and only if, f{u) = f{h~^{v)) = g{v), it follows that ft is the 
vertex mapping induced by / and g, which by hypothesis preserves and reflects 
adjacencies. □ 
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The previous formulation of graph isomorphism suggests a simple, general 
method for testing whether two graphs G\ and G2 are isomorphic. Given an 
arbitrary numbering / of Gi, if for some numbering g of G2, the vertex mapping 
induced by / and g preserves and reflects adjacencies, then Gi and G2 are 
isomorphic; otherwise (if for all numberings g of G2, the vertex mapping induced 
by / and g is not adjacency-preserving or it is not adjacency-reflecting) then Gi 
and G2 are not isomorphic. 

Actually, in order to systematically explore the vertex mappings induced by 
the numberings of the two graphs, cyclic rotations of the numberings need to 
be taken into account. The following algorithmic scheme encodes the general 
method for testing graph isomorphism. Let G\ and G2 be two graphs with 
ni = ri2. Upon completion, if Gi = G2, the vertex mapping h found is a graph 
isomorphism between G\ and G2. 

function isomorphic (Gi,G2,/i) 
let p be a numbering of graph G\ 
for all numberings q of graph G2 do 
for all cyclic rotations q' of q do 

let h be the vertex mapping induced by p and q' 
if h preserves and reflects adjacencies then 
return true {Gi and G2 are isomorphic} 
end if 
end for 
end for 

return false {Gi and G2 are not isomorphic} 

end function 

Given a numbering / of a graph G\ and a numbering p of a graph G2, the 
vertex mapping h induced by / and g can be computed in time linear in the 
number of nodes, h can be tested for preserving and reflecting adjacencies in 
time linear in the number of nodes, and the number of cyclic rotations of g is 
also linear in the number of nodes. Therefore, the worst-case complexity of the 
general method is 0 (a-|- 6 cn^), where 0 (a) is the cost of finding some numbering 
/ of Gi and 0 {b) is the cost of finding each of the 0 (c) numberings of G2. In 
particular, when the number of different numberings is bounded by a constant 
and each numbering can be found in linear time, the method gives a simple 
O(n^) graph isomorphism algorithm. 

Interesting instances of the graph isomorphism problem arise when dealing 
with restricted classes of graphs, because the graph isomorphism problem can 
be solved in polynomial time for some classes of graphs. It is the structure of 
those graphs what makes them polynomially solvable, which is often reflected 
in a sufficient condition for graph isomorphism, which induces a numbering. As 
a matter of fact, the general method is efficient for testing graph isomorphism 
when restricted to classes of graphs that have a unique or a small number of 
different numberings, which can be enumerated in low-order polynomial time. 
Notice that general graphs admit 0 (n!) different numberings. 
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Interval graphs have an interval representation that defines a unique ordering 
of the vertices according to interval precedence, and since the interval representa- 
tion can be found in linear time [1], the general method gives a simple algorithm 
for isomorphism of interval graphs in O(n^) time. 

Outerplanar graphs have a unique planar embedding such that all vertices 
lie on a single face, and since the unique planar embedding, which can be found 
in linear time [8], defines a unique numbering of the graph, the general method 
gives a simple algorithm for isomorphism of outerplanar graphs in O(n^) time. 

Biconnected outerplanar graphs have a unique Hamiltonian circuit, and since 
the unique Hamiltonian circuit, which can be found in linear time [8], defines a 
unique numbering of the graph, the general method gives a simple algorithm for 
isomorphism of biconnected outerplanar graphs in O(n^) time. 

Triconnected planar graphs (polyhedral graphs) have a unique Eulerian cir- 
cuit, because they have a unique combinatorial embedding in the plane, and 
since the unique Eulerian circuit, which can be found in linear time [5,11], de- 
fines a unique numbering of the graph, given by an enumeration of the vertices 
in the order in which they are visited for the first time in a traversal of the Eu- 
lerian circuit, the general method gives a simple algorithm for isomorphism of 
triconnected planar graphs in O(n^) time, whereas previous methods [5,11] take 
O(m^) = 0(n‘^) time, although more complex algorithms have been proposed [4] 
which take O(nlogn) time. 
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Abstract. We review approximability and inapproximability results for 
MIN-SUM scheduling problems and we focus on two main techniques 
for designing polynomial time approximation schemes for this class of 
problems: ratio partitioning and time partitioning. For both techniques 
we present examples which illustrate their efficient use. 



1 Introduction 

In the context of scheduling problems, we are given, in general, a set of n jobs, 
and a number m of machines. Each job has a processing time Pij on machine 

i, a release date a deadline dj and a positive weight Wj. A partial order, 
describes a precedence relation among jobs; we say that jobs are independent ii -< 
is empty. Two different scheduling models can be considered: the preemptive one, 
where the interruption of a job execution is allowed, and the non-preemptive, 
where once a job starts executed it should be completed. A schedule S is an 
assignment of jobs to machines and an associated starting time Sj for each job 

j. A schedule is feasible if at any time a machine processes at most one job and 
if i ^ j, then job j starts its execution after the completion of job i. By Cj is 
denoted the time at which job j completes its execution and by Di is denoted 
the time by which machine i complete the execution of all jobs assigned to it. 
OPT denotes the objective value of the optimum schedule. 

To refer to the several variants of scheduling problems, that can be obtained 
from the above general description, the standard three-field, o:|/3|7, notation 
scheme of Graham et al. uni is used. The first field describes the machine envi- 
ronment. A single machine environment is denoted by a = 1, (identical) parallel 
machines by a = P(m) and unrelated (parallel) machines by a = R{m); the op- 
tional m denoting a constant number of machines. The second field of the nota- 
tion describes any special conditions and constraints on the problem parameters 
(release dates, deadlines, precedence relation, processing times, multiprocessor 
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R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 432- ITOl 2001. 
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jobs, preemption). The third field of the notation describes the optimization 
criterion. According to optimization criteria, scheduling problems can be clas- 
sified into two classes: MIN-MAX criteria and MIN-SUM criteria. The most 
common MIN-MAX criterion is the completion time (makespan) of the sched- 
ule, minj; maxj Cj, and also the maximum lateness, min 2 ; maxj (Cj — dj). 
The most common MIN-SUM criterion is the total (weighted) completion time, 
mini; "WjCj, and also the total (weighted) flow time, min^; WjFj, 
where Fj = Cj — Vj . 

It is well known that only some of the simplest variants of scheduling prob- 
lems can be solved in polynomial time. For a thorough listing of polynomial 
solved variants as well as complexity results on scheduling problems the reader 
is referred to the book and a web page of Brucker . Hence, most of the recent 
research is directed towards designing approximation algorithms for the NP-hard 
variants of scheduling problems. A p-approximation algorithm computes polyno- 
mially a solution within p factor of the optimum one. In this setting we are 
interested in designing /9-approximation algorithms with p as small as possible 
as well as on finding lower bounds for p. When p is a constant, then we say that 
the approximation algorithm is a constant factor one. We say, however, that we 
have a Polynomial Time Approximation Scheme (PTAS) if we give an algorithm 
which, for any value of e, can construct an (1 -|- e)-approximation solution. If the 
time complexity of a PTAS is also polynomial in 1/e, then it is called a Fully 
PTAS (FPTAS). For lower bounds or inapproximability results, the notion of 
NP-completeness is used to disprove the existence of good approximation algo- 
rithms, unless P=NP. It is also well known that, unless P=NP, there is no a 
FPATS for strongly NP-hard problems. 

Historically, MIN-MAX scheduling problems were the first investigated in the 
approximation algorithm framework and a lot of results have been proposed after 
Graham’s seminal paper On the other hand, much less was known about the 
approximability of MIN-SUM scheduling problems until 1996. However, during 
the last five years, there was an explosive progress on the approximality of MIN- 
SUM problems, that recently led to PTASs for many of them. 

In this paper we present a brief survey of this research on MIN-SUM schedul- 
ing focusing on PTASs design techniques. In Section 2 we present the history of 
known results for constant approximation algorithms and we comment on the 
techniques used and their limitations. Next, we present two main techniques for 
designing polynomial time approximation schemes for this class of problems. In 
Section 3 we present the ratio partitioning technique and in Section 4 the time 
partitioning technique. For both cases we give examples which illustrate hoe 
these techniques may be used to derive PTASs for such problems. Examples are 
taken from HH]. 

2 History of Approximation Results and Techniques 

Although Shani ™ 1976, gave a FPTAS for Pm \ \ ^ WjCj, no other approx- 
imation result was known until 1991, when Ravi, Agrawal and Klein j'itij gave 
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an 0(lognlog^^- Wj)-approximation algorithm for l|prec| ^ u'jC'^ . Moreover, 
much less was also known about the approximability of MIN-SUM scheduling 
problems even until 1995, despite the fact that there was an extensive litera- 
ture on the polyhedral structure of these problems (especially of single machine 
problems). This fact, however, was only exploited for characterizations of poly- 
nomial solvable variants and for computing optimum solutions (see Queyranne 
and Schulz for a thorough survey of this research area) . 

Philips, Stein and Wein m in 1995, gave the first constant approximation 
algorithms: an (8 -I- e) for both l|rj,pmtn| ^ and P\rj,pmtn\J2wjCj), a 
(16 -I- e) for and a (24 -|- e) for Hall) Shmoys and 

Wein in m and its journal version by Hall, Schulz, Shmoys and Wein d) 
in 1996 and 1997, improved dramatically the known approximation factors and 
gave algorithms for many new variants. Their results, some of which are the best 
known even today (see Table 1 below), were motivated by the successful work 
done on polyhedral structure of scheduling problems and build on earlier research 
on computing near-optimum solutions for MIN-MAX scheduling problems, by 
rounding optimum solutions to linear relaxations. In view of future results, the 
most important point of their work was the introduction of an time-interval in- 
dexed LP formulation, since it was proven later very fruitful for designing PTASs 
for many MIN-SUM problems. The general idea of time indexed LP formulation 
was used in the past in several forms. Potts used decision variables 5ij , where 

Sij implies that job Ji precedes job Jj in the schedule. Dyer and Wosley P used 
variables Xjt, where Xjt = 1 means that job Ji completes its execution at time t. 
Wosley m and Queyranne m used as decision variables the completion times 
Cj of jobs themselves. Hall et al. US] adopted this last formulation to obtain 
their results for single machine problems and proposed a different, more compact 
linear time-interval indexed LP formulation: subdivide the time horizon into the 
intervals [1, 1], (1, 1 -I- e], (1 -I- e, (1 -I- e)^], . . . ,, where e is an arbitrary small posi- 
tive constant. Then, the linear program only specifies the interval in which the 
job is completed and since all completion times within an interval are within a 
(1 -I- e) factor of each other, the relative schedule within an interval will be of 
little consequence. This was the first use of time partitioning. 

After Hall et al. USE! there has been an explosion of research in this area 
with successively smaller constant factor algorithms as well as algorithms for 
new variants of MIN-SUM scheduling problems. Due to space limitations, here 
we present, in Table 1, only the best known constant approximation factors and 
summarize briefly all this research. Except a few combinatorial methods, these 
algorithms follow the general and successful relaxation method: first an opti- 
mum solution to a relaxation of the original problem is polynomially obtained, 
then this solution is rounded, either to order the jobs in time or to assign the 
jobs to machines, to obtain a near-optimum optimum solution of the original 
problem. The large body of algorithms within this method can be classified 
according two factors: the type of relaxation and the rounding technique used. 
Types of relaxations include preemptive schedule relaxations and linear and con- 
vex programming relaxations, while rounding techniques include scheduling by 
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completion times, scheduling by a-points and randomized rounding (see j,34j for 
a elegant survey on LP relaxation algorithms) . 

Although there is practical evidence (?rnT) that LP relaxations can provide 
very good approximate solutions, all relaxation based algorithms suffer from a 
major drawback: their solution is not compared directly to the solution of the 
original problem but to the solution of the relaxed one. However, there are gaps 
between the solutions of the relaxed and the original problem and any algorithm 
obtaining such a rounded solution inherits these gaps. Hall et al. Schulz 
and Skutella m and Chekuri and Motwani jZj argue on the fact that several 
relaxation methods for l\prec\'^WjCj can not improve on the best known 2- 
approximation algorithm. Moreover, Torng and Uthaisombut have shown 
that any preemptive relaxation for l|rj| ^ Cj, that uses the SRPT algorithm to 
solve the relaxed problem, can not led to an approximation factor better than 
e/(e — 1), that matches the best known factor of 1.58 |H|. It seems therefore, 
that such approaches methods can never yield a PTAS for MIN-SUM scheduling 
problems. 

Indeed, until 1998 only FPTASs, based on dynamic programming, for weakly 
NP-hard MIN-SUM problems were known (see Shani’s [23| for a FPATS for 
Pm I and Alon, Azar, Woeginger and Yadid 0, in 1998, ob- 
tained a PTAS for P \ \ Skutella and Woeginger ^ realized that last 

results implies also a PTAS for P | ^ = g| ^ E^nd then in a first step they 

generalized this to a PTAS for ^ ratios within a constant range. In a second 
step they obtained the first PTAS for a strongly NP-hard MIN-SUM problem 
(P I I ’^WjCj). Their main idea was ratio partitioning, i.e. the partitioning of 
the jobs into subsets according to their ^ ratios and such that the ratios of all 
jobs in one subset are within a constant range. Then, using the first step, near 
optimum solutions are computed for all subsets. Finally, these schedules are con- 
catenated in order of nonincreasing ratios in each machine. Subsequently, Afrati, 
Bampis, Kenyon and Mills 0 based also on ratio partitioning provided a PTAS 
for Rm I I ' However, the concatenation of schedules obtained following 

the ratio partition technique can not be applied in the presence of release dates. 
For this case the time partitioning technique was recalled by several researchers 
0 and it was proven powerful enough to yield PTASs in the presence of release 
dates. Based also on time partitioning Afrati et al. P| presented recently PTASs 
for P‘2\fiXj\J2Clj and Pm\fixj,pmtn\'^Cj and Fishkin et al. presented 
PTASs for Pm\fiXj,Tj \ ^ wjCj and Pm\sizej,rj \ ^ WjCj. 

The important point with all existing PTASs is that they rely either on 
time partitioning or on ratio partitioning technique. It seems more important 
in view of the fact that they succeeded where older techniques had failed and 
it is most likely that these techniques can be used to obtain PTASs for more 
involved scheduling variants. Moreover, all known PTASs use a combination of 
elementary and well understood combinatorial ingredients: partitioning, group- 
ing, geometric rounding, enumeration and dynamic programming. The approach 
taken, in these PTASs, is to sequence several transformations of the input prob- 
lem. Some transformations are actual changes to simplify the input, while others 
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are applied as thought experiments to the optimum solution to prove there is 
a near-optimum solution with nice structure. Each transformation potentially 
increases the objective value by only 1 -|- 0(e), that is, it produces 1 -I- 0(e) loss. 
Besides problem-variant-specific transformations, some general transformations 
are used in these techniques. Such transformations are geometric rounding and 
time stretching. 



Table 1. Summary of results (see also for a discussion on open problems). 
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2.1 Geometric Rounding and Time Stretching 

The first simplification creates a well-structured set of possible processing times 
and release dates: With 1 -|- e loss, we can assume that all processing times and 
release dates are rounded to integer powers of 1 -I- e. To prove, we change the 
values in two steps. First multiply every release date and processing time by 
1 -b e; this increases the objective by 1 -b e. Then decrease each date and time 
to the next lower integer power of 1 -b e (which is still greater than the original 
value). This can only improve things. 

The second transformation time stretching creates idle time gaps evenly 
spread along the schedule. These gaps will be used conveniently in later trans- 
formations to accommodate jobs that we are moving around in order to prove 
that there is a near-optimum schedule with nice structure. Often, we will use 
time stretching to add an idle amount of epj time units before every job j. This 
will increase each completion time (and hence their sum) by a factor of 1 -b e. 
According to this last transformation, we may also assume that the starting time 
of each job j is Sj > epj: If job j completed at time t > pj then it now completes 
at time (1 -b e)t and therefore does not start until time et > epj. 

3 Ratio Partitioning 

For the basic 1|| X] ‘^j^j variant W. Smith, in 1956, designed a very easy greedy 
algorithm: sequencing in order of non-increasing Wj/pj ratios produces an op- 
timum schedule m- We consider here the case of unrelated machines with no 
release dates. In this section, by pj ' we denote in this section the execution time 
of job j on machine i and for simplicity of notation we think in terms of two 
machines; however, it should be clear that it can be generalized for any constant 
number of machines. If we consider the restriction of the optimum schedule to 
each one of the machines, then Smith’s ratio rule also applies: the jobs executed 
on machine Mi are processed in order of non-increasing ratios = Wj/p^^\ 
It is therefore clear that the only difficulty is to decide, for each job, on which 
machine to execute it. To this end, we define a ratio partitioning, i.e. a partition 
of the range of ratios into disjoint windows such that the i*^ window consists 
of ratios in the range Fi = {a^Rmax,a^~^Rmax], where Rmax = 
and a = 6“^ . 

A first observation is that we can think of each job having its two ratios either 
in the same window or in adjacent windows. If the processing times of a job are 
very different on the two machines, then we can directly decide to schedule it on 
the machine on which it has the shorter processing time. In quantitative terms: 
We can assume that all jobs such that < epj^^ are scheduled on Mi (we say 

that this job is Mi -decided), and that all jobs such that pj^^ < ep^p are scheduled 
on M 2 (we say that this job is M 2 -decided). To prove, we show that by moving 
all such jobs to the machine with the small processing time, the cost does not 
increase much: we move each such job in the other machine in the place so that 
it starts before or at time Cj (its completion time in the old schedule) . Thus the 
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new completion time of this job is at most Cj + < Cj + tp^p < Cj{l + e) 

The other jobs affected are the jobs in machine Mi (suppose we moved the job 
to Ml) which are delayed. Each such job k is delayed by at most e times the 
sum of M 2 -processing times of moved jobs which completed before time Ck in 
M 2 ; but this is e times Ck- Thus, in this case, we can put p^ = 00 or p^^ = 00 
accordingly. Moreover, for each undecided job p^ < ep^ and p^ < ep^ , that 

is e < R-P /E-P < 1/e. Hence, each job is either decided or it has its two ratios 
either in the same window or in adjacent windows. 

Towards to a further problem simplification we define a job- type as the 

set of jobs having the same pair of ratios, i.e. Tr^r' = {/ | Rp'' = R and rP = 
R'}. Using this definition, a second observation is that “very different” job types 
do not affect each other. To state formally, we rewrite the cost function as: 
Ei,fc on Ml -^ib/ k] + on M 2 -^2b/ k], where 
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Notice that an entry M.i\j,k] represents the contribution to the cost function 
of job j, due to job k. Thus, if job j is scheduled on machine Mi, and the 
jobs on Ml with ratio equal to Rp^ are ordered randomly, then the expected 
weighted completion time of j satisfies: wjCj = qjj -^ib/ ^]- Using this 
formulation of the cost function, we can prove that if the ratios of two jobs are 
very different, then we can neglect their interaction: 

1. For every pair of jobs {j, k} such that R^ < we can replace Mi[j, k] 

by 0. 

2. For z = 1, 2, and for every pair of jobs {j, k} such that j and k are either Mi- 



decided or undecided, and such that rP < we can replace both Mi[j, k] 

and M 2 [j, k] by 0. 

We prove the first in two steps: First for each k on Mi, we look at the jobs j 
on Ml such that rP < e^Rp\ and whose completion times satisfy Ck < eCj. 
The presence of k before such a j has only a marginal effect on j’s completion 
time (easy to prove). Second, we look at the pairs {j, k} which are scheduled by 
on Ml, such that rP < e'^RP, but whose completion times satisfy Cj < Cfc/e. 
The contribution to the cost function of those pairs is very little. 

To prove the second: Consider the case z = 1. From item (1) above we can 
replace M.i\j, k] by 0. If either job is Mi-decided, then it will never be placed on 
M 2 and so we can replace M. 2 [j,k] by 0. If both are undecided, then we have: 
Pj > epj and p^ < p^. /e. iiius. ^ < --^_e 

By item I above, again, we can replace A42[j, k] by 0. 

Finally, we can bound the number of jobs within each job-type by a constant 
number. Consider a job-type Tr^ri and the sum of the weights of jobs in Tr^ri, 
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We group small jobs together to build larger compound jobs. Thus, we 
can assume that every job of a job-type has weight at least c^Wr^ri, and hence 
each job-type has at most 1/e^ = 0(1) elements. More specifically, we group 
greedily so that the sum of weights of the tiny jobs in each group (which now 
represents the weight of the compound job) is between c^Wr^ and 2e^WR^R'. 
Clearly a schedule that uses the compound jobs results in a cruder fractioning 
of the jobs in the job type among the machines. To take care of that we stretch 
time by 1 -|- e and this stretching is enough to accommodate the compound jobs. 
This is because, with 1 -|- 0{e) loss, we can discard schedules that put a fraction 
of a job type smaller than eWR^Ri on one of the machines. 

3.1 Example: R2\\'^WjCj 

We first simplify the input of the problem by applying geometric rounding on the 
parameters defining the jobs. Then, we identify the jobs which are Mi-decided 
or M 2 -decided. We replace all irrelevant processing times (i.e. if j is Mi~ 

decided and if j is M 2 -decided) by oo, which means that the corresponding 
ratio becomes equal to 0. Then we do all the simplifications of the previous 
subsection, namely calculating first the A4i[j,k] and then putting accordingly 
some of them equal to 0. 

Note that, the set of undecided jobs having at least one ratio in a specific 
window has size 0(log^(l/e)/e^): Roughly, because of geometric rounding, we 
have only a constant number of different ratios within a window and, because 
undecided jobs have their ratios either in the same window or in adjacent win- 
dows, there are only a constant number of job types that may contain a job of 
a particular ratio in one of the machines. Moreover, there is a constant number 
of jobs within each job type. 

Summing up the above analysis: a) Each job type has the two ratios either in 
the same window or in adjacent windows and b) Job types affect each other only 
in the stretch of two windows and c) there is a constant number of undecided 
jobs with at least one ratio in a specific window. An immediate consequence of 
that is dynamic programming with respect to the windows of ratios. In each 
stage i of the dynamic programming, we have fixed the schedule for jobs with 
both ratios in windows Fi, ... Fi and keep all possible schedules of jobs with at 
least one ratio in window T). Thus, each stage runs in 0(1) time. 

4 Time Partitioning 

For an arbitrary integer x, we define Rx = (l-l-e)“. We partition the time interval 
(0, oo) into disjoint intervals of the form Ix = [Rx,Rx+i)- We will use Ix to refer 
to both the interval and the size {Rx+i — Rx) of the interval. We will often use 
the fact that Ix = eRx, i-e., the length of an interval is e times its start time. We 
consider an optimum schedule and focus on the schedule of one machine. Moving 
a job j inside the interval in which it runs produces a 1 -I- 0(e) loss because it 
will only increase its completion time by at most R which is less than eRx+i 
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or less than eCj. Another useful observation is that a job can not be arbitrarily 
large: In the worst case, a job starts at time epj and completes at time epj +pj, 
hence, it crosses at most s = [logi_|_g = |"logi_|_,,(l + ^)] intervals. 

If a job j crosses several intervals, we know that it crosses at most s intervals, 
and then we can accumulate idle time for all s intervals in the beginning of job j, 
and we know that the relative size of the 1st and sth interval is = 

e. Also know that by time stretching we can create, in each interval an extra idle 
gap of size elx- In what follows in this section, we assume unit weights. 



4.1 Large and Small Jobs 

In many cases, jobs that are much smaller than the interval in which they run 
are essentially negligible and easy to deal with. The difficulty comes from jobs 
that are large — taking up a substantial portion of the interval. Small jobs are a 
lot like fractional jobs in linear relaxation methods, and fractional solutions are 
usually easier to find. We assume, in this section, that small jobs are defined to 
be those with pj < elx, where Ix is the interval where job j runs. 

We restrict attention to schedules in which no small job crosses an interval: 
By time stretching, add in each interval a gap of elx and finish the single small 
job that may be crossing an interval. Also, note that there is a constant number 
of large jobs in an interval, namely at most -. 

Interestingly, we can almost get rid altogether of large jobs (except a constant 
number of them). We can do that by moving most large jobs in larger intervals 
without much degrading the quality of the schedule. Thus, we end up with an 
overall constant number of large jobs: 

Lemma 1. With o 1 + 0{e) loss, we assume: 

1. For each job j, Sj > mm{^,e’^OPT}. 

2. All jobs are small in the intervals they run except at most 1/e^. 

Proof. The proof of (2) follows from (1): Note that Sj > Pjje^ means that job j 
is small when it runs. Also, all jobs starting after time e^OPT are at most 1/e^. 
The small jobs trivially satisfy this condition in (1). We move around large jobs 
in larger intervals so that they are now small in the interval that they run. We 
do that only for large jobs that complete before e^OPT. 

We move a task that completes before time (1 + in the first gap after 
time t = Cjje^. We have to worry about three considerations: 

1. There is space large enough to accommodate all jobs that are landed on 
a particular gap. This is so because the gap at time t is equal to e^t, hence, at 
least equal to Cj, thus can hold all jobs which complete before time Cp 

2. There is not some hugely long crossing job in this interval. This is guar- 
anteed by the fact that any crossing job may cross at most s intervals. Thus, 
we put the moved job in the beginning of the crossing job and still it is in an 
interval where it is considered small. This is so because the intervals along which 
a job stretches differ in size by a factor at most e. 
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3. The added cost is not too much: We need to bound the added cost of 
the large jobs we moved forward. The last interval from which we advance jobs 
ends at time t = e^OPT. Thus the total completion time of the advanced jobs 



IS 

as desired. 



1 (l + e)i^a; 



— e5 Si>0 



Liktir. - 



= e ■ OPt(1 + e)^ < 2eOPT 



4.2 Example: 1| r’j | X) 

We apply geometrical rounding to all variables of the input of the problem. 
Thus, jobs are released only in the end of intervals. The preemptive variant 
of ISQ can be solved optimally by SPRT rule. Supposing that, in the 
optimum schedule of the non preemptive problem all jobs are small, we can set 
release dates equal to max{rj, ^}. According to Lemma ^ this does not change 
the feasibility of a nearly optimum schedule. We can now use SPRT rule to 
obtain an optimum preemptive schedule and then use the gaps in each interval 
to finish up the preempted jobs. These are small, so they need no more than elx 
to finish up. Running SPRT followed by finishing up preempted jobs corresponds 
to running SPT rule. 

Our only problem, now, is that the optimum schedule may require some 
jobs to run when they are large. By Lemma ^ there is still a nearly optimum 
solution with only a constant number of large jobs. We can schedule these large 
jobs by complete enumeration and schedule the rest using SPT rule with the 
new release dates as above. Again, using SPT rule for the small jobs, essentially 
we use SPRT rule (which still remains optimum in the presence of the fixed large 
jobs) to obtain a preemptive solution and then finish up the jobs. 



4.3 Example: P2|/ia;j| ^ Cj 

For this problem, we have two kinds of monoprocessor jobs, the 1-jobs and the 
2-jobs dedicated to run on machine M\ and M 2 respectively. Biprocessor 12-jobs 
are dedicated to both machines. 

In what follows, we use the following proved in |Sj: In any optimum schedule, 
1) All jobs of the same type are scheduled according to the shortest processing 
time rule (SPT). 2) If a biprocessor job is not the first job of the schedule, then 
its starting time must be equal to the completion time of some other job. 

Besides geometric rounding, we need some more transformations. We begin 
with the following lemma which states that jobs scheduled to run in parallel 
have comparable processing times. 

Lemma 2. With o l-|-e loss, we never have two jobs scheduled in parallel whose 
processing times differ by a factor greater than ^ . 

Proof. Let j be a job executed w.l.o.g. on processor Mi, and let A be the set of 
jobs scheduled in parallel with j on processor M 2 with processing times < -\pj. 
Recall that, since the single-machine-jobs on each processor are scheduled in the 
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order of increasing processing times, the jobs in A form an interval. We consider 
the following two cases: 

a) J2i^APi ^ ^Pj- time stretching to move starting time of j to 

Sj + epj, thus jobs in A and job j do not run in parallel any more. b)J2i^APi — 
epj. Put the job j in the gap so that it completes at time Let Sj be the 
starting time of job j in the initial schedule. Since for every i € A, pi < e^pj, we 
get that at least ^ jobs in A complete in the time-interval [Sj + ^ , Sj + epj]. 

ThenXlieAC'* > + > l^i^j+Pj) = 4 ^ C^. Thus, §- <4:eJ2^eAC^■ 

Summing over all jobs j, we get an increase of at most 4eOPT. 

Moreover, it can be easily proven that if the set of jobs T can be partitioned 
into subsets L and S such that maxpj < e® minp^-, then, all jobs in S are sched- 

j&S j&L 

uled before any job in L is scheduled. A separation result allows for partitioning 
the jobs into ’’long” jobs and ’’short” jobs with a few (negligible) ’’medium” jobs 
in between. Its proof is a simple algebraic manipulation. 

Lemma 3. Let Lq = {j G 'T\pj > e^OPT}, and Li = {j G T\e^^e^OPT < pj < 
e®(®“^)e^OPT}. If Lq ^ 0, then there is some i < [log^^^ such that 

OPT{Lo U Li U . . . U Li) < (1 -h e^)OPT{Lo U Li U . . . U L,_i). 

Li is the negligible set of ’’medium” jobs, Lq U . . . U Li_i are the ’’long” jobs 
and the rest are the ’’short” jobs. It is not hard to prove that, with a I -I- 0(e) 
loss, we can schedule all short jobs before any long or medium job is scheduled. 
Clearly, we can find such a partition in polynomial time. 

The above transformations together with a polynomial algorithm for the pre- 
emptive variant |3 yield a PTAS for this problem. For the short jobs, according 
to the results in the beginning of this section, we can rearrange biprocessor jobs 
within an interval, so that to obtain only one preemption per interval. Then, we 
move large monoprocessor jobs forward in intervals where they are small and fin- 
ish up preempted (monoprocessor) jobs to derive a non-preempted schedule. For 
the long jobs we find a schedule by complete enumeration and then concatenate 
the two schedules. 
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Abstract. The Maximum Weight Stable Set (MWS) Problem is one of 
the fundamental algorithmic problems in graphs. It is NP-complete in 
general, and it has polynomial time solutions on many particular graph 
classes, some of them defined by forbidden subgraphs. A classical exam- 
ple is the result of Minty that the MWS problem is polynomially solvable 
for the class of claw-free graphs. The complexity of the MWS problem is 
unknown for the class of Ps-free graphs. 

We give a survey on recently obtained efficient algorithms for the 
MWS problem for several graph classes defined by forbidden subgraphs 
where the algorithm avoids to recognize whether the (arbitrary) input 
graph is in the class i.e. the output is either a correct solution of 
the MWS problem or the fact that the input graph is not in the 
class. Such algorithms were called robust by Spinrad. The algorithms 
use the concepts of modular decomposition and of clique width of graphs. 

Keywords: Maximum Weight Stable Set Problem on graphs; robust al- 
gorithms; modules and homogeneous sets in graphs; prime graphs; graph 
structure; clique width. 



1 Introduction 

A vertex set in a finite undirected graph G = (V,E) is stable if its elements are 
pairwise nonadjacent. For a vertex weight function w on V, let aw(G) denote the 
maximum weight sum of a stable set in G. The Maximum ( Weight) Stable Set 
{M{ W)S) Problem asks for a maximum (weight) stable set in the input graph G. 

The M(W)S problem is a basic algorithmic graph problem occuring in many 
models in Computer Science and Operations Research. It is NP-complete even 
for triangle-free graphs [46] (see [33]) which led to the investigation of a variety 
of graph classes defined by forbidding small graphs such as claw-free graphs 
for which a polynomial time algorithm for the MWS problem was given by 
Minty [43] (and independently for the MS problem by Sbihi [47]). Note that for 
(A'i^ 4 ,diamond)-free graphs [16] the MS problem is NP-complete. 

Minty’s algorithm for claw-free graphs has been extended to a polynomial 
time algorithm for the MWS problem on chair-free graphs by Alekseev in [1]. 

The complexity of the MWS problem for Ps-free graphs is still unknown 
which recently led to the investigation of some subclasses of Ps-free graphs where 
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polynomial time algorithms for the MWS problem were given - see e.g. [2, 4, 6, 7, 
11,14,15,29,31]. 

Summarizing the known complexity results for P 4 extensions (see Figure 1) 
gives: The MWS problem for F-free graphs 

1. can be solved in polynomial time if F G {co-gem, chair} 

2. is NP-complete if F G { co-P, bull, co-chair, house, gem}, and 

3. and its complexity is open for F G {F5, C5, P} 

The aim of this note is to give a survey on an approach combining the con- 
cepts of 

(i) modular decomposition, 

(ii) clique width and 
(in) robust algorithms 

which recently attracted much attention. 

To (i): The modular decomposition tree of a finite undirected graph G gives 
a tree description of G where the leaves are the vertices of G and there are three 
kinds of internal nodes: join (denoted by (1)), co-join (denoted by (0)) and 
prime nodes. The papers [41,42,23,24] describe this approach and give linear 
time algorithms for computing the modular decomposition tree. 

It is a simple observation that the MWS problem is solvable in polynomial 
time for a graph class whenever it is solvable in polynomial time for the prime 
graphs belonging to this class since aw{Gi{l)G 2 ) = raax{aw{Gi) , aw{G 2 )) , and 
oiw{Gi{0)G2) = ciw{Gi) a^{G2)- 

This immediately leads to polynomial time solutions for the MWS problem 
using the modular decomposition tree in a bottom-up way for computing aw{G). 

To (ii): Bounded clique width allows to efficiently solve all algorithmic 
problems expressible in a certain kind of Monadic Second Order Logic in linear 
time [21], among them the MWS Problem. The notion of clique width has 
been introduced in [20] and is intimately related to modular decomposition of a 
graph. We give a list of graph classes having bounded clique width. 

To (Hi ): In [48], Spinrad defined the concept of a robust algorithm for a graph 
class C and an algorithmic problem as follows: 

— If G G C then the algorithm solves the problem correctly. 

— If G ^ C then the algorithm either solves the problem correctly or finds out 

that G ^ C (sometimes with a corresponding witness for this fact) . 

Thus, a robust algorithm delivers a correct answer in any case, solves the 
problem for every graph in C and avoids to recognize C. This is of crucial im- 
portance whenever the recognition time bound for C is worse than solving the 
algorithmic problem on the graph class (in extreme cases, the recognition prob- 
lem can be NP-complete and the algorithmic problem can have a linear time 
solution) . 
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We give examples of efficient robust algorithms for the MWS problem on 
some graph classes such as (bull,chair)-free graphs and (P 5 ,co-chair)-free graphs. 

Our results improve previous ones on particular graph classes in several ways: 

1. for some classes, the structure investigations imply bounded clique width for 
the whole class or bounded clique width under suitable assumptions which 
are useful for solving the MWS problem; 

2. for some examples, previous time bounds can be improved due to the fact 
that robust algorithms avoid to recognize whether the input graph is in the 
class; 

3. for bounded clique width, not only the MWS problem but also many other 
problems such as Minimum Dominating Set, Steiner Tree, Maximum Weight 
Clique can be solved efficiently. 

2 Notions and Preliminary Resnlts 

Throughout this paper, let G = (V, E) be a finite undirected graph without self- 
loops and multiple edges and let \V\ = n, \E\ = m. Let G = {V, E) with xy £ E 
if and only xy ^ E for x,y £ V denote the complement graph of G. 

The edges between two disjoint vertex sets X^Y form a join (co-join) if for 
all pairs x £ X, y £Y, xy £ E (xy ^ E) holds. Let A(1)B (A(O)i?) denote the 
corresponding join (co-join) operation between A and B. 

A vertex z £V distinguishes vertices x,y £ V \i zx £ E and zy ^ E. X 
vertex set M C M is a module if no vertex from V\M distinguishes two vertices 
from M i.e. every vertex v £ V\M has either a join or a co-join to M. A module 
is trivial if it is either the empty set, a one- vertex set or the entire vertex set V. 
Nontrivial modules are called homogeneous sets. A graph is prime if it contains 
only trivial modules. The notion of modules plays a crucial role in the modular 
(or substitution) decomposition of graphs (and other discrete structures) which 
is of basic importance for the design of efficient algorithms - see e.g. [44] for 
modular decomposition of discrete structures and its algorithmic use. 

A homogeneous set M is maximal if no other homogeneous set properly 
contains M. It is well-known that in a connected graph G with connected com- 
plement G, the maximal homogeneous sets are pairwise disjoint which means 
that every vertex is contained in at most one maximal homogeneous set. The 
existence and uniqueness of the modular decomposition tree is based on this prop- 
erty, and recently, linear time algorithms were designed to determine this tree - 
see [41,42,23,24]. The tree contains the vertices of the graph as its leaves, and the 
internal nodes are of three types: they represent a join or co-join operation, or a 
prime subgraph. The graph G* obtained from G by contracting every maximal 
homogeneous set to a single vertex is called the characteristic graph of G. It is 
not hard to see that G* is connected and prime. 

Let N(v) := {u : u £ V,u ^ v,uv £ E} denote the open neighborhood of 
V and iV[w] := N(v) U {?;} the closed neighborhood of v. For U C V let G(U) 
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denote the subgraph of G induced by U. As already mentioned, a vertex set 
U C V is stable (sometimes called independent) in G if the vertices in U are 
pairwise nonadjacent. A vertex set C/ C is a clique in G if C/ is a stable set in 
G. 

For a vertex weight function w on V, let a^, (G) denote the maximum weight 
sum of a stable set in G and let lOw(G) := a^iG) denote the maximum weight 
of a clique in G. If w{v) = 1 for all vertices v then we omit the index w. Note 
that for all classes on which the MWS problem can be efficiently solved, the 
Maximum Weight Clique Problem can be efficiently solved on the complement 
class. 

For k > 1, let Pfe denote an induced chordless path with k vertices and k — 1 
edges, and for k > 3, let Gk denote an induced chordless cycle with k vertices 
and k edges. A hole is a Gk, k > 5. For a P 4 with vertices a,b,c,d and edges 
ab,bc,cd, the vertices a and d (6 and c) are called the endpoints (midpoints) of 
the P 4 . Note that the P 4 is the smallest nontrivial prime graph and the com- 
plement of a P 4 is a P 4 itself (where midpoints and endpoints change their roles) . 



( 1 ) co-gem 



(2) P5 



(3) chair (fork) 



(4) co-P (co-banner) (5) P (banner) ( 6 ) Cg 




(7) bull ( 8 ) kite (co-chair, co-fork) (9) house (C 0 -P 5 ) 




( 10 ) gem 




Fig. 1. All one- vertex extensions of a P4 



See Figure 1 for the definition of the bull, chair, P, gem and their comple- 
ments, the bull, co-chair, co-P and co-gem. 

The diamond is the K 4 — e i.e. a four vertex clique minus one edge. The claw 
is the graph consisting of four vertices a, b, c, d such that a (the center of the 
claw) is adjacent to the pairwise nonadjacent vertices b,c,d. The domino is a 
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graph consisting of a P 5 plus a vertex adjacent to the central vertex and to the 
end- vertices of the P 5 . An A is the graph resulting from a domino by deleting 
one of its edges between two vertices of degree two. 

Let denote a set of graphs. A graph G is T -jree if none of its induced 
subgraphs is in T . There are many papers on the structure and algorithmic use 
of prime .7^-free graphs for T being a set of P 4 extensions; see e.g. [35,36,28,29, 
30,31,38,4,6,11]. 

A graph G is a 

— thin spider if G is partitionable into a clique C and a stable set S with 
|G| = [S'] or |G| = [S'] -I- 1 such that the edges between C and S are a 
matching and at most one vertex in G is not covered by the matching (an 
unmatched vertex is called the head of the spider); 

— thick spider if it is the complement of a thin spider; 

— matched co-bipartite graph if G is partitionable into two cliques Gi, G 2 with 
[C'll = [<^21 or |Gi| = IG 2 I -b 1 such that the edges between Gi and G 2 are 
a matching and at most one vertex in Gi and G 2 is not covered by the 
matching; 

— co-matched bipartite graph if it is the complement of a matched co-bipartite 
graph; 

— bipartite chain graph if it is a bipartite graph B = (AT, Y, E) and for all 
vertices from X (T), their neighborhoods in Y (Al) are linearly ordered (bi- 
partite chain graphs appear in [50]; in [40] they are called difference graphs); 

— co-bipartite chain graph if it is the complement of a bipartite chain graph. 

— enhanced co-bipartite chain graph if it is partitionable into a co-bipartite 
chain graph with cliques Gi, G 2 and three additional vertices a, 6 , c (a and c 
optional) such that N{a) = Gi U G 2 , N{b) = Gi, and N{c) = G 2 . 

— enhanced bipartite chain graph if it is the complement of an enhanced co- 
bipartite chain graph. 

There are some basic tools extending small subgraphs in prime graphs which 
are of great use in many cases when investigating small classes of prime graphs. 

Lemma 1 ([37]). If a prime graph contains an induced C 4 then it contains an 
induced house or A or domino. 

The graph complement version of Lemma 1 is 

Corollary 1. If a prime graph contains an induced 2 K 2 then it contains an 
induced P 5 or A or co-domino. 

Lemma 2 ([45]). If a prime graph contains a triangle G 3 then it contains an 
induced house or bull or double-gem. 

The graph complement version of Lemma 2 is 
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Corollary 2. If a prime graph contains a set of three pairwise nonadjacent ver- 
tices then it contains an induced P 5 or bull or A. 

Note that unlike the case of three pairwise nonadjacent vertices, the case of 
four pairwise nonadjacent vertices leads to infinitely many prime extensions as 
shown in [10]. Based on the reducing copath method of [51], [9] describes also 
the extensions of a claw, a diamond and a P 3 + Ki which lead to finitely many 
extension graphs. 

Hoang defined in [35] that a graph is Pj^-sparse if no set of five vertices in G 
induces at least two distinct Pj’s. From the definition of P 4 -sparse graphs, it is 
obvious that a graph is P^-sparse if and only if it contains no C 5 , P 5 , P 5 , P, P, 
chair, co-chair (see Figure 1). The motivation for P 4 -sparse graphs and variants 
was given by 

— applications in areas such as scheduling, clustering and computational se- 
mantics; 

— natural generalizations of cographs. 

Theorem 1 ([35]). The prime P^-sparse graphs are the spiders. 

Spiders were called turtles in [35]. 

The class of P 4 -sparse graphs was extended in several ways leading to simple 
prime graphs - see e.g. [3,29,32]. 



3 Cographs, Clique Width, and Algorithmic Problems 

The P 4 -free graphs (also called cographs) play a fundamental role for graph 
decomposition; see [13] for a survey on this graph class and related ones. For 
a cograph G, either G or its complement is disconnected, and the cotree of G 
expresses how the graph can be recursively generated from single vertices by re- 
peatedly applying join and co-join operations. Thus, the modular decomposition 
tree of cographs contains no prime nodes. See [19] for linear time recognition of 
cographs and [17,18,19,13] for more informations on P^-iree graphs. 

In [20], the join and co-join operations were generalized to the following 
operations in labeled graphs: 

— creation of single vertices with integer label i, 

— disjoint union (i.e. co-join), 

— join between all vertices with label i and all vertices with label j for i ^ j, 
and 

— relabeling vertices of label i by label j, 

[20] then defines the clique width cwd{G) of a graph G as the minimum 
number of labels which are necessary to generate a given graph by using the 
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three operations. Obviously, the clique width of cographs is at most two. A k- 
expression for a graph G of clique width k describes the recursive generation of G 
by repeatedly applying the three operations using only a set at most k different 
labels. 



Proposition 1 ([21,22]). The clique width of a graph G is the maximum of the 
clique width of its prime graphs, and the clique width of its complement graph G 
is at most twice the clique width of G. 

Recently, the concept of clique width of a graph attracted much attention 
since it gives a unified approach to the efficient solution of many algorithmic 
graph problems on graph classes of bounded clique width via the expressibility 
of the problems in terms of logical expressions; in [21], it is shown that every 
algorithmic problem expressible in a certain kind of Monadic Second Order Logic 
called LinEMSOL(Ti^L) in [21], is linear-time solvable on any graph class with 
bounded clique width for which a fc-expression can be constructed in linear time. 

Hereby, in [21] it is mentioned that, roughly speaking, MSOL(n) is Monadic 
Second Order Logic with quantification over subsets of vertices but not of edges; 
MSOL(ri_i) is the extension of MSOL(ti) with the addition of labels added 
to the vertices. LinEMSOL(ri /,) is the extension of MSOL(Ti.i) which allows 
to search for sets of vertices which are optimal with respect to some linear 
evaluation functions. The Maximum Weight Stable Set Problem is an example 
of a LinEMSOL(ri,i) problem. 



Theorem 2 ([21]). Let C he a class of graphs of clique width at most k such 
that there is an 0{f{\E\), |P|) time algorithm, which for each graph G in C, 
constructs a k-expression defining it. Then for every LinEMSOL(Ti^L) problem 
on C, there is an algorithm solving this problem in time 0{f{\E\), |P|). 

Note that the clique width of distance hereditary graphs (which are exactly 
the (house, hole, domino, gem)-free graphs; see [13] for different characteri- 
zations of these graphs) is at most 3 as shown in [34], and thin spiders and 
bipartite chain graphs are (house, hole, domino, gem)-free. Thus, their clique 
width is at most 3. Moreover, for thick spiders, co-bipartite chain graphs, 
matched co-bipartite graphs, co-matched bipartite graphs, enhanced bipartite 
and co-bipartite chain graphs, induced paths and cycles of arbitrary length as 
well as for graphs having a fixed number of vertices, it can be seen by using 
Proposition 1 or in a straightforward way by giving a fc-expression that their 
clique width is bounded by a constant. 

Recall that Theorem 1 says that the prime P 4 -sparse graphs are the spiders, 
and according to Proposition 1 and the fact that the clique width of spiders is 
bounded by 3, it follows that Rj-sparse graphs have bounded clique width. In a 
similar way, other examples lead to bounded clique width - see the next section. 
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4 Some Classes of Bounded Clique Width 

We first mention complete descriptions of prime (P 5 ,P 5 ,buU)-free, {P^,P^,co- 
chair)-free and of (bull, chair, co-chair)-free graphs: 

Theorem 3 ([28]). If G is a prime (P^,P^,hull)-free graph then G or G is a 
C 5 or a bipartite chain graph. 

Theorem 4 ([29]). IfG is a prime (P5,P5,co-chair)-free graph then G is either 
a bipartite chain graph or a spider or C 5 . 

Theorem 4 has a slightly simpler proof in [15]. 

Theorem 5 {[8]). If G is a prime (bull, chair, co- chair) -free graph then G or 
G is co-matched bipartite, a path, or a cycle. 

Our next example gives a complete description of (P 5 ,diamond)-free graphs: 

Theorem 6 ([4]). If a connected and co-connected graph G is (P5,diamond)~ 
free then the following two conditions hold: 

(1) The homogeneous sets of G are P^-free; 

(2) For the characteristic graph G* of G, one of the following conditions hold: 

(2.1) G* is a matched co-bipartite graph; 

(2.2) G* is a thin spider; 

(2.3) G* is an enhanced bipartite chain graph; 

(2.4) G* has at most 9 vertices. 

Theorem 6 turns out to be useful in some other cases since other classes 
of prime graphs are contained in the class of (P 5 ,diamond)-free graphs as the 
subsequent theorems show. 

Theorem 7 ([15,8]). Let G be a prime graph. 

(i) If G is (P^, CO -chair, gem) -free then G is (P^, diamond) -free. 

(ii) If G is (P5 ,bull,co-chair)-free then G is either the complement of an induced 
path or cycle or (P^, diamond) -free. 

Theorem 8 ([5]). Let G be a prime graph. 

(i) If G is (P5,P5, gem) -free then G is a distance-hereditary graph or a C^. 

(ii) IfG is (P5,gem,co-gem)-free ((P5, bull, gem) -free, (P5, chair, gem)-free) then 
G is (P^, diamond) -free. 

(Hi) IfG is (P5,P, gem) -free then G is a matched co-bipartite graph or a gem-free 
split graph (being distance hereditary ) or G5. 

Now, Proposition 1 and the fact that all particular graph occuring in the 
theorems such as bipartite chain graphs, spiders etc. have bounded clique width, 
implies: 
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Corollary 3. The graph classes described in Theorems 3-8 have bounded clique 
width. 

Meanwhile, it was shown in [12] that the class of (chair, co-P,gem)-free 
graphs has bounded clique width. 

It is interesting to study other combinations of forbidden P 4 extensions. In 
some of the cases, it turns out that the class has unbounded clique width since 
it contains the class of bipartite graphs, split graphs or unit interval graphs or 
their complement class which classes have unbounded clique width as shown in 
[34]. In many cases, however, the question whether the clique width is bounded 
remains open - see [5] for a list of such problems. 

5 Robust MWS Algorithms 

Recall that for all classes of constant-bounded clique width for which k- 
expressions can be found in linear time, the MWS problem can be solved in 
linear time. This holds true for all examples in the previous section but does 
not automatically mean a linear time robust algorithm since the recognition 
time bounds are not necessarily linear. 

For (P 5 ,diamond)-free graphs however, there is a slightly larger class C having 
linear time recognition and the same clique width which implies that there is a 
robust linear time MWS algorithm for class C and thus for (P 5 ,diamond)-free 
graphs [4]: take C = the class of graphs whose homogeneous sets are cographs 
and whose characteristic graph is either matched co-bipartite or a thin spider or 
an enhanced bipartite chain graph or have at most 9 vertices. 

This is also applicable for some subclasses of (P 5 ,diamond)-free graphs 
mentioned in Theorem 8 . 

Thus we get the following principle (without using the clique width ap- 
proach) which solves the MWS problem in linear time on (P 5 ,diamond)-free 
graphs in a robust way: 

Algorithm aw(G) for (P 5 ,diamond)-free graphs 

Input: A connected and co-connected graph G. 

Output: The maximum weight aw(G) of a stable set in G or a proof that 

G is not (P 5 ,diamond)-free (G is not in C). 

(1.1) Determine the maximal homogeneous sets Hi,... ,Hk of G and check 
whether they all induce cographs. If not, then STOP - G is not 
(P 5 ,diamond)-free (is not in C). 

(1.2) Otherwise, for alH G {1, . . . , fc} determine aw{Hi) using the cotree repre- 
sentations of Hi, .. . ,Hk. 
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(1.3) Construct the weighted characteristic graph G* by shrinking Hi to one 
vertex hi and giving hi the weight aw{Hi) for all i G {1, . . . , k}. 

(2) If G* is a matched co-bipartite graph, a thin spider, an enhanced bipartite 
chain graph or a graph with at most 9 vertices then solve the problem in 
a straightforward way; 

(3) If G* is none of these graphs then G is not (P 5 ,diamond)-free (G is not in 

C). 

Note that the Maximum Weight Clique Problem can be solved in a completely 
analogous way. 

Theorem 9 ([4]). Algorithm aw{G) for (P^, diamond) -free graphs is correct and 
works in linear time 0{n + m). 

In [12] it is shown that the simple structure of (chair ,co-P,gem)-free 
graphs (leading to bounded clique width) enables a similar approach as for 
(P 5 ,diamond)-free graphs. In this way, [12] improves the time bound G(n^) for 
the MWS problem given in [39,52] to a robust linear time algorithm. 

Now to other examples where we do not get linear time bounds but use some 
structural results showing that under additional conditions which help to solve 
the MWS problem, the clique width of some classes is bounded. 

Theorem 10 ([6]). Let G be a prime graph with stability number a{G) > 3. 

(i) If G is (P^, claw) -free then G is either a thin spider or a 3-sun. 

(ii) If G is (P^, bull) -free then G is a bipartite chain graph. 

(Hi) If G is (bull, claw) -free then G is an induced Pk, k>5, or an induced Ck, 
k>6. 

Theorem 10, (i) improves a result of [7]. Moreover, Theorem 3 is a corollary 
of Theorem 10, (ii). 

Theorem 10 leads to an obvious robust algorithm for the MWS problem 
on (P 5 ,claw)-free, (P 5 ,bull)-free and (bull,claw)-free graphs with time bound 
0{MM) by using the structural result given in Theorem 10 in the case that 
a(G) > 3 and computing aw{G) in a direct way if a{G) < 2. 

In [26] , De Simone and Sassano solved the MS problem for bull- and chair-free 
graphs in time 0(nA). Moreover, De Simone showed 

Lemma 3 ([25]). If a prime (bull,chair)-free graph contains an induced co- 
diamond then it is bipartite or an induced odd cycle G 2 k+it k >2. 

The subsequent result improves the results by De Simone and Sassano [26, 
25] in several ways: 

Theorem 11 ([8]). If G is a prime (bull, chair) -free graph with a{G) > 4 or 
containing a co-diamond then G is co-matched bipartite or an induced path or 
cycle. 
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This leads to an 0{nm) time bounded robust algorithm for the MWS 
problem in [8] for (bull,chair)-free graphs: If G is co-gem-free then the problem 
can be solved in a straightforward way in time 0(nm), and if G contains a 
co-gem i.e. G contains a co-diamond, Theorem 11 is applicable. 

Note that the clique width of (bull,chair)-free graphs is unbounded since 
every bipartite graph is (co-P,C 5 , bull, co-chair, house, gem)-free, and in [34] it 
was shown that an n x n square grid has clique width n -I- 1 which means that 
bipartite graphs and thus also co-bipartite graphs have unbounded clique width. 
Thus, the class of (P,C 5 ,bull,chair,P 5 ,co-gem)-free graphs (being a subclass of 
the (bull,chair)-free graphs) has unbounded clique width. 

Our last examples are based on the fact that for 2A'2-free graphs, the MWS 
problem can be solved in polynomial time due to a result of Farber [27] showing 
that 2 K 2 ~iree graphs have 0{n?) maximal stable sets, and by enumerating these 
maximal stable sets using the algorithm given in [49], an 0{n^) time bounded 
algorithm for the MWS problem is obtained. By using some additional tricks, 
the time bounds can be improved. 

Theorem 12 ([15]). If G is a prime (P^,co-chair)-free graph containing an 
induced 2 K 2 then G is matched co-bipartite. 

The class of (Ps,co-chair)-free graphs has unbounded clique width due to a 
result of [34] showing that unit interval graphs (which are the claw-free interval 
graphs) have unbounded clique width. 

Note that prime (Ps,co-P)-free graphs are even 2K2~iree which implies that 
the MWS problem is solvable in time O(n^) on these graphs. 

Theorem 13 ([11]). IfG is a prime (P^,gem)-free graph containing an induced 
2 K 2 then G is matched co-bipartite. 

We do not know whether the class of (Ps,gem)-free graphs has unbounded 
clique width. 

Robust algorithms for the MWS problem have the following time bounds: 

1. (Ps,co-chair)-free graphs: 0{nm) time [15]; 

2. (P 5 ,gem)-free graphs: 0{n^) time [11]; 

Note that in [14], a robust 0{nm) time algorithm for the (unweighted) MS 
problem on (Ps,P)-free graphs is given. 
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Optics is a key technology in communication networks to enable bandwidth 
intensive applications, such as video conferencing, real-time medical imaging, 
broadcasting services to home [S|. Readers interested in the relevant aspects of 
fiber optic technology are referred to mm- 

In WDM (Wavelength Division Multiplexing) optical networks, the available 
optical fiber bandwidth is utilised by partitioning it into several channels, each 
at a different wavelength. Each wavelength can carry a separate stream of data. 
In general, a WDM network consists of routing nodes (equipped with switches 
operating directly in the optical domain), interconnected by point-to-point optic 
fiber links. Each fiber-link can support a given number of wavelengths. 

A basic mechanism of communication in a wavelength routed network is 
a lightpath. A lightpath is an all-optical communication channel between two 
nodes in the network and it may span more that one fiber link. The interme- 
diate nodes in the fiber path route the lightpath directly in the optical domain 
using their switches. Data transmission through a lightpath does not require elec- 
tronic processing at the intermediate nodes, thus reducing delay and improving 
reliability. In the absence of any wavelength conversion device, the lightpath is 
required to be on the same wavelength channel through its path in the network. 
A fundamental requirement in a wavelength routed WDM network is that two or 
more lightpaths traversing the same fiber link must be on different wavelength 
channels so that they do not interfere with one another. 

Given a communication request from node x to node y, one sets up a lightpath 
for it by choosing a path from a: to y in the network and assigning a wavelength 
to this path. Similarly, given a set of communication requests, one has to set up a 
corresponding set of lightpaths under the basic constrain recalled above that two 
lightpaths traversing the same fiber link must be assigned different wavelengths. 
There is a vast literature dealing with the problem of minimizing the number 
of wavelengths to set up lightpaths for (classes of) communication requests (see 
e.g., 1 1 12141/1 1 ,411)1 1 41 lT)j V A summary of graph theoretical problems associated 
with routing in optical networks can be found in PITTH . 

Multicasting is the simultaneous transmission of data from a source to a sub- 
set of all possible destinations in the network. Multicast service is becoming 
a key service in computer networks. Many applications such as shared withe- 
boards, distributed interactive simulation, and teleconferencing require efficient 
multicast, where a sender transmits data to a group of receivers in an efficient 
way M2|. 

* Research partially supported by the European Community under the RTN project: 
“Approximation and Randomized Algorithms in Communication networks 
(ARACNE)” 
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In this talk, we will present several algorithmic problems and results con- 
nected to the problem of multicasting in all-optical networks. 
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Abstract. In this paper we propose an advanced randomized coloring 
algorithm for the problem of balanced colorings of hypergraphs (dis- 
crepancy problem). It allows to use structural information about the 
hypergraph in the design of the random experiment. This yields color- 
ings having smaller discrepancy than those independently coloring the 
vertices. We also obtain more information about the coloring, or, con- 
versely, we may enforce the random coloring to have special properties. 
Due to the dependencies, these random colorings need fewer random bits 
to be constructed, and computing their discrepancy can be done faster. 
We apply our method to hypergraphs of d-dimensional boxes. Among 
others, we observe a factor 2^*^^ decrease in discrepancy and a reduction 
of the number of random bits needed by a factor of 2'^. 

Since the discrepancy problem is a particular rounding problem, our 
approach is a randomized rounding strategy for the corresponding 
ILP-relaxation that beats the usual randomized rounding. 

Keywords: randomized algorithms, hypergraph coloring, discrepancy, 
randomized rounding, integer linear programming. 



1 Introduction and Results 

1.1 The Discrepancy Problem and Integer Linear Programs 

In this paper we deal with a special kind of integer linear programs, namely 
those that model discrepancy problems. Roughly speaking, the combinatorial 
discrepancy problem is to partition the vertex set of a given hypergraph into two 
classes in a balanced manner, i.e., such that each hyperedge contains the same 
number of vertices in each of the two partition classes. To be precise: 

We call a pair TL = (X,S), where X is finite set and £ C 2^ , a hypergraph. 
The elements of X are called vertiees, those of £ hyperedges. A partition of X into 
two classes is usually represented by a coloring y : A — > (7 for some two-element 
set C. The partition then is formed by the color classes G C. It turns 
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out to be useful to select —1 and +1 as colors. For a coloring x : X — >• {—1,1} 
and a hyperedge E G £ then the expression 

X{E) ■■= x(x) 

x^E 

counts how many of the +l-vertices of E cannot be matched by —1-vertices. 
\x{E)\ is therefore a measure of how balanced the hyperedge E is colored by x- 
As it is our aim to color all hyperedges simultaneously in a balanced manner, 
we define the discrepancy of x with respect to H by 

disc(?^,x) := max|x(A)|. 

E^c 

The discrepancy problem originated from number theoretical problems (e. g. 
van der Waerden |vdW27) or Roth Em), but due to a wide range of ap- 
plications and connections it has received an increased attention by computer 
scientists and applied mathematicians during the last twenty years. 

Most notably is the connection to uniformly distributed sets and sequences 
which play a crucial role in numerical integration in higher dimensions (quasi- 
Monte Carlo methods) . This area is also called geometric discrepancy theory. Via 
the so-called “transference principle” , geometric and combinatorial discrepancies 
are connected with each other. An excellent reference on geometric discrepancies, 
their connection to combinatorial ones and applications is the book of Matousek 
[IMat99j . 

The notion of linear discrepancy of matrices describes how well a solution of 
a linear program can be rounded to an integer solution [lattice approximation 
problem). Due to work of Beck and Spencer p4S84j and Lovasz et al. jLSV86| . the 
linear discrepancy can be bounded (in a constructive manner) by combinatorial 
discrepancies. 

Further applications are found in computational geometry and the theory of 
communication complexity. For these and other applications of discrepancies in 
theoretical computer science we refer to the new book of Chazelle f( ;ha,00| . 

Discrepancy problems can be formulated as integer linear programs. Since we 
believe that our methods can be extended to this more general context, let us 
briefly sketch the connection: Let X = {1, . . . , n| =: [n] and £ = (Ai, . . . , Em}- 
Then the following integer linear program (here given as a 0, 1 ILP) solves the 
discrepancy problem for T-L: 
minimize 2A 
subject to 



H a;. 




i = 


. , m 


idEj 
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i = I,-- 
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idEj 
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The problem in using the linear relaxation of this ILP is that there always 
exists the trivial solution x = (xi, . . . ,Xn) = Therefore, a fruitful connec- 
tion between solutions of the [0, l]-relaxation and the original problem is not to 
be expected. 

On the other hand, randomized rounding strategies for this trivial solution 
yield random colorings and, vice versa, generators of random colorings can be 
interpreted as randomized rounding strategies. Thus both problems are strongly 
connected. It also turns out that the tools used and the difficulties occurring 
in both problems are very similar. Thus we think that the methods of this 
paper might have a broader application and are not restricted to the discrepancy 
problem. 

Note that when applying a randomized rounding strategy to the above ILP, 
we do not need to care about feasibility (as for most randomized rounding prob- 
lems). The reason is that any infeasibility inflicted by the rounding, i.e., any 
violation of the constraints, simply is a discrepancy. 



1.2 Algorithmic Aspects of Randomized Coloring and Randomized 
Rounding 

Discrepancy is an AP-hard problem. It is even AP-hard to decide whether a 
zero discrepancy coloring exists or not. Efficient algorithms finding an optimal 
coloring therefore are not to be expected. Indeed, very little is known about 
the algorithmic aspect of discrepancy. For some restrictions of the problem a 
nice solution exist, e. g. for hypergraphs having vertex degree at most t. Beck 
and Fiala gave a polynomial time algorithm leading to a coloring having 

discrepancy less than 2t. 

A common algorithmic approach for the general case (and in fact the only 
one known to us) are random colorings obtained by independently choosing a 
random color for each vertex. Via a Chernoff-bound analysis (see e.g. Alon and 
Spencer [ASnOj l this yields colorings having discrepancy 0(^n logm), where 
as above n shall always denote the number of vertices and m the number of 
hyperedges. More precisely, they show 

Theorem 1. A random coloring obtained by independently choosing a random 
color for each vertex has discrepancy 

disc('H,x) < x/2nln(4 m) 



with probability at least ^ . 

Note that this yields a randomized algorithm computing a coloring of the 
claimed discrepancy by repeatedly generating and testing a random coloring 
until the discrepancy guarantee of the theorem is satisfied. This algorithm has 
expected run-time 0(n{R + to)), where R is the complexity of generating a 
random bit. To get rid of the random aspect, several so-called derandomiza- 
tion techniques have been developed. We refer to |Sri()l| for a survey. Random 
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constructions show that (at least for suitable values of n and m) there are hy- 
pergraphs having discrepancy Q{^n\og ^). Thus this approach cannot be im- 
proved significantly in the general case. 

Via the transfer sketched in the previous subsection, all of the above holds 
for general rounding problems as well. In particular, no randomized rounding 
strategy for a linear problem of n variables and m constraints can guarantee a 
violation in the constraints of less than f?(y^nlog ^). 

A central problem with random colorings (random rounding) is therefore 
how to take into account the special structure of the hypergraph (the ILP). One 
way to deal with this is to use random colorings as above, but to tighten the 
analysis using the structural information. Limited dependencies of ‘bad’ events 
play a crucial role here. Two papers in this context are Schmidt et al. [SSS95] 
and Srinivasan jSHS!. 

A second approach is to use a different kind of random colorings, i.e., to 
design the random experiment in a way that it exploits the structure of the 
hypergraph. This is what we do in this paper. 



1.3 Our Results 

We analyze a way of generating random colorings not by independently coloring 
the vertices, but by enforcing some dependencies. Thus we are able to exploit 
structural information about the hypergraph. 

This proves to be effective in several ways. Firstly, it allows to generate 
random colorings having smaller discrepancy. Being a fairly general class of 
hypergraphs that have some common structure, we analyze hypergraphs of d- 
dimensional boxes. Our randomized colorings beat the ordinary random colorings 
in terms of discrepancy by a factor roughly 2“*/^. 

A second advantage is that we also obtain some more information about the 
random coloring. For example, we may prescribe that our colorings should be 
fair, that is, have equal-sized color classes. This can be useful in some applica- 
tions, e. g. the recursive method to construct balanced multi-colorings of fl )S()1] 
uses fair colorings. A nice thing from the technical point of view is that we get 
these fair colorings without extra technical difficulties. Usually, working with 
fair colorings is more difficult, since the hyper geometric distribution is harder to 
analyze than the binomial one (cf. Chvatal and Uhlmann CIEmI). 

A third point concerns the complexity of generating the colorings. Due to 
the dependencies the number of random bits needed to generate our random 
colorings is smaller than for ordinary random colorings. For the hypergraphs of 
d-dimensional boxes we reduce the number of random bits needed by a factor 
of 2‘^. This is important, if generating random bits is costly. 

Finally, computing the discrepancy of our random colorings can be done 
faster compared to ordinary random colorings. The reason is that (depending 
on the hypergraph, of course) the number of relevant hyperedges, i.e., those for 
which x(A) has to be computed, is reduced. Since a typical randomized algorithm 
computes a low-discrepancy coloring by repeatedly generating a random coloring 
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and then computing its discrepancy until a satisfactory solution is found, this 
fact also speeds up computing low-discrepancy colorings. 



2 Structured Randomized Coloring 

As mentioned in the introduction, our aim is to generate random colorings that 
do not independently color the vertices, but on the contrary use suitable depen- 
dencies that reflect the structure of the hypergraph. To do so, we partition the 
vertex set into classes. Within such a class, we will have perfect dependence, 
that is, each vertex determines the color of all other vertices in the class. For 
two vertices lying in different classes, their colors shall be chosen independently. 
The problem of this very general approach is of course to catch the structure 
of the hypergraph through the partition and the dependencies in the partition 
classes. We show an example of how to do so in the next section and proceed by 
fixing the general framework. 

Let V — {Pi,... ,Pr} be a partition of the vertex set. Let XPi ■ Pi 
{ — 1,4-1} be colorings such that |xp. (PTlPi)! < 1 holds for all edges E G £. For 
a hyperedge E G £ set 

I{V,E) :={iG[r]\xP,{EnP^^O}, 

hence yp. (PTlPi) G { — 1, +1} for all i G I{V, E). We generate a random coloring 
like this: For each i G [r] := (1, . . . ,r| we ‘flip a coin’, i.e., independently and 
uniformly choose a random sign Si G {— 1,4-1}. Let x ■ ^ ^ {“!)+!} denote 
the union of the SiXPi, that is, we have x{x) = £iXPi{P) for ^ ^ Pi- We call 
X a structured random coloring with respect to P and the XPi P ^ [r] • Here is a 
discrepancy estimate for such a coloring. 

Lemma 1. Let y &e o structured random coloring with respect to V and the 
XPi P S [r] . For any hyperedge E G £ we have 

P(|y(P)| > A) < 2e"=o(p.E)i . 

The proof is not very difficult. We still state it here, as it reveals why our 
structured random colorings are superior to the ordinary ones. 

Proof. For each i G I{P,E) define a random variable Zi = xpAE FI Pi) = 
HxeEnPi XPii.x). Set Z = J2i^i{v,E) ^ote that Z = y(P). Since the Z,, are 
mutually independent —1, 1 random variables, we may apply the Chernoff bound 
(cf. fAS00| . Corollary A. 1.2) and get 

P(|Z|>A)<2e 21/(7’, E)i, 



□ 
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Comparing Lemma Q with the analogous estimate for ordinary random col- 
orings 

P{\X{E)\> \) <2e~^ , 

we see that in our version we replaced the cardinality \E\ of the hyperedge by 
the possibly smaller number of Pi such that xpAE fl Pi) yf 0. We thus reduced 
the relevant size of the hyperedges. 

There is a second way structured random colorings can improve discrepancy 
bounds, namely by reducing the number of relevant hyperedges. Set 

E-p :=\J{{EnP,)\xpdEnP,)^0} 



for all E G £ and £p := {Ep\E G £}. From the definition of structured random 
colorings it is clear that any structured random coloring y with respect to P and 
the Xp,P G P fulfills x{E) = x{Ep). In particular, we have 

disc(?^,x) = disc((W^’-p),x)- 

Depending on the partition P and the colorings the mapping E i— >■ Ep 
is not injective, and hence \£p\ < \£\. In this case we only need to consider the 
smaller number \£p \ of hyperedges. Since the discrepancy bound depends on the 
number of hyperedges just logarithmically, this effect is less important compared 
to the reduction of the relevant sizes of the hyperedges. It can however be useful, 
as it makes the computation of disc('H,x) easier. 

This observation together with Lemma Q yields 

Theorem 2. Let sq := maxpg^ \I(P,E)\ and mo '■= \£v\- Then a struetured 
random eoloring with respeet to P and the xp^P ^ P diserepancy at most 

disc('H,x) < \/2so ln(4mo) 



with probability at least ^ . 

Proof. Omitted. □ 

There are two more points to add concerning structured random colorings. 
One is that we may get information about x through properties of the colorings 
Xp,P G P- For example, if each xp^P £ P Fas equal-sized color classes, then 
this also holds for x- Conversely of course, we may enforce certain properties on 
X by choosing suitable colorings xp, P GP. 

Secondly, from the definition of structured random colorings it is clear that 
to generate a structured random coloring with respect to P and XPjP ^ Pj 
need only \P\ random bits instead of n random bits needed for ordinary random 
colorings. 
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3 Higher-Dimensional Boxes 

In this section we show how the method described above can be applied to an 
actual example, namely hyper graphs of higher-dimensional boxes. They display 
some regularity that can be exploited. On the other hand, this is still a fairly 
general class of hypergraphs. For similarly geometrically defined hypergraphs, 
so-called cylinder intersections, a discrepancy result was used to prove bounds 
on multi-party communication complexities by Babai et al. [BHK98) . 

We say that a hypergraph T~L = (X, £) is a hypergraph of d-dimensional 
boxes for some d G N, if there is a decomposition X = Xi x ■ ■ ■ x X^ such that 
each hyperedge E G £ has a representation E = Ei x ■■■ x Ed respecting the 
decomposition of X, i.e., such that Ei C Xi holds for all i G [d]. Let us agree to 
call any set E\ x ■ ■ ■ x Ed such that Ei C Xi for all i G [d] a, box. 

For an arbitrary number r we denote by \r ~\2 the smallest even integer not 
being smaller than r. We show 

Theorem 3. Let T-L = (X, £) be a hypergraph of d-dimensional boxes. Let X = 
Xi X • • • X Xd be a eorresponding decomposition. Set n := \X\, Ui := \Xi\ for i G 
[d] and m := \£\. Then there are structured random colorings y : X — >■ {—1,1} 
having discrepancy at most 

disc{n,x) < r«dl2ln(4m) 

with probability at least Generating these structured random colorings needs 
2~‘^n random bits. 

Note that Theorem Q using ordinary random colorings only proves a bound 
of ■ ■ - Ud ln(4m) = ^/2nln{im). This is worse by a factor of 2“^/^ (in the 

case of even Ui). 

Proof. Without loss of generality we may assume that Xi = [ui]. We first con- 
sider the case that all n^, * = 1 , . . . , d, are even. 

Set V := {{2x\ — 1, 2xi| x • • • x {2xd — 1, 2xd} \ Vi G [d\ : Xi G [^]|, that is, 
we partition the vertex set into small cubes of size 2 ^^ in a rather canonical way. 

The coloring corresponding to each small cube shall be such that adjacent 
(in the Hamming distance sense) corners always receive opposite colors. More 
formally, for a given cube P G V we define a coloring xp ■ P 1} by 

Xp{x) = 1 4=^ E Xi is even 

ie [d] 



for all X = {xi , . . . , Xd). 

Let E G £ and P G V. As both E and P are boxes, so is if fl P. From 
the definition of xp we see that any subbox S of P such that jS”! 7 ^ 1 fulfills 
x(S') = 0. Hence \xp(E FI P)| < 1 for all if G f and P G V. We may therefore 
define random structured colorings with respect to V and the xp , P G V as 
introduced in Section |21 Let y be such a coloring. 
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We have \I{V,E)\ <\V\=2 ^n. Applying Theorem 0 with sq = 2 we 
get the bound 

disc('H,x) < \/n ln(4m), 

which finishes the proof in the case that all i = 1, . . . , d, are even. 

For the general case we consider the hypergraph 'Hi — ([|"ni] 2 ] x ••• x 
2 ]) ^)- Since H is a subhypergraph of Hi, any coloring xi for Hi by re- 
striction yields a coloring y = (xi)|x for H. The claim follows from disc("H, y) < 
disc('Hi,yi) and applying the case of even cardinality sets to Hi. □ 

Apart from this improved discrepancy bound, we also gained some informa- 
tion about the coloring itself. For example, all geometric boxes are colored very 
nicely. We call a box B Q X a geometric box, if it can be represented in the form 
B = Ii X ■ ■ ■ X Id for some intervals A C [rii],i G [d]. As can be seen easily, these 
boxes fulfill |y(S)| < 2‘^ for any structured random coloring y with respect to 'P 
and yp, P GV. 

Furthermore, our colorings are fair, that is, they are perfectly balanced on the 
whole vertex set. We have y(Al) = 0, if |A| is even, and y(Al) G {—1, 1}, if |A1| 
is odd (note that any odd cardinality set S cannot have discrepancy y(<5') = 0, 
no matter what the coloring y is like). 

Fair colorings are important in recursive algorithms and divide-and- 
conquer procedures. The relation between combinatorial discrepancies and e- 
approximations (and thus also the “transfer principle” connecting geometric 
and combinatorial discrepancies) rely on the concept of fair colorings. We re- 
fer to the first chapter of Matousek jlVI atilt)] for the details. Another example 
is the recursive method to construct balanced multi-colorings from 2-color dis- 
crepancy information (cf. jlJSOlj ). 

If X G £, then fairness can be obtained by recoloring some vertices in the 
larger color class. This increases the discrepancy by a factor of at most 2. With 
our structured random colorings, we can get fairness “for free” . 

To show how such structural knowledge about the random coloring can be 
used to reduce the number of relevant hyperedges, we examine a special class of 
box hypergraphs: The hypergraph of all d-dimensional boxes in [nnY for some 
no G N is H^g := ([no]'^, {S'! x • • • x Sd\Si C [no]}). The usual random colorings 
(Theorem [Q fulfill 



disc{Hf,^) < y^2np ln(4 2"od) 

= -\/21n2ng^ \/d (1 -I- o(l)) 
a; l.lSn,^ \/d(l-ko(l)) 

with probability at least In the following theorem we improve this bound 
and also show that less than of the 2"“^^ hyperedges are relevant. For 

convenience let us concentrate on the case that no is even. The general result 
can be obtained from similar reasoning as in the proof of Theorem 0 
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Theorem 4. Let nQ,d G N, uq even, d>2 and set n := Uq. There are struetured 
random eolorings x for TLno ^^o,t have 

disc{Ht^,x) < 1.05 Vd 

with probability at least ^ . Generating these colorings needs 2~^n random bits. 
To compute their discrepancy, only hyperedges have to be regarded. 

Proof. Set V := {{2xi — 1, 2x\} x • • • x {2xd — 1, 2xd} \ x\, - ■ ■ ,Xd S [^]} and 
define xp> P S P as in the proof of Theorem 0 Let x be a random coloring with 
respect to V and xp^P £ P- As above we have \I{V,E)\ < 2~^Uq. 

Now let us bound the number of hyperedges that are relevant for the dis- 
crepancy of X with respect to H. We first compute \£v\- Let E = Si x ■ ■ ■ x Sd. 
Assume that for some i G [d] and x G [^] we have {2a;— 1, 2a;} C Si. Then no box 
P = {2a;i — 1, 2a;i| x • • • x {2xd — 1, 2xdj such that Xi = x intersects E in exactly 
one vertex. From some elementary properties of boxes and the definition of xp 
we derive xp(PnP) = 0. Thus E-p = {S\ x • • • x (Si\{2x — 1,2a;}) x • • • x Sd)v. 
By induction we see that tt : if H> Ep is a projection of £ onto £. Therefore we 
need to count its fixed points only to get \£p\. We just exhibited that a necessary 
(and sufficient) condition for a hyperedge if = Pi x • • • x to be a fixed point 
is 

Vie [d]Va;e [^] : |Pi n |2x - 1, 2a;}| < 1. 

For each i G [d], a; e [^] we therefore have exactly three possibilities: Si fl (2a; — 
1,2a;} is empty or (2a; — 1} or {2a;}. This makes \£p \ — S’^od /2 f^^ed points. 

Still, not all hyperedges in £p are relevant. From the structure of x we derive 
a further reduction: Note that for all i G [d], 

7* : Pi X • • • X X • • • X Pd Pi X • • • X ([no] \Si) x ■■■ x Sd 

is a fixed-point-free bijection of £ that leaves the set £p of reduced hyperedges 
invariant and preserves discrepancy: We have 

X{E) = -xhi{E)) 

for all hyperedges if G P. In particular, the group (71 , . . . ,Xd) — ^2 ^^ts on £ 
and £p in such a way that all orbits have length 2'^. As all elements of an orbit 
have the same discrepancy with respect to x> it is enough to consider just one 
representative from each orbit. Let £q C £ he system of representatives of the 
orbits in £p, that is, Pq contains exactly one element of each orbit in £p. Since 
[Pol = 2“‘^|P73|, we reduced the number of relevant hyperedges by another factor 
of 2‘^. From Theorem 0 we finally get (with probability at least |) 

disc(ii, x) = disc((X, Pq), x) 

< y^2 2-‘^n[(ln(4 2-^3"o^/2) 

d+1 

< Vhr3 2-^A„^2 

< 1.05 2-P'^no'^y/d. 

□ 
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We should remark that the size reduction yields a change in the order of 
magnitude in terms of d, namely the additional factor, whereas counting 

the relevant edges (less than (7/8)" of the total number of edges) only improves 
the constant by about 11%. Recall however, that reducing the number of relevant 
hyperedges does reduce the complexity of checking whether a structured random 
coloring fulfills the discrepancy bound of the theorem or not. 

4 Summary and Conclusion 

In this paper we presented a new way of generating random colorings for the 
discrepancy problem of hypergraphs. This allows to use structural information 
about the hypergraph and thus 

— improves discrepancy guarantees, 

— allows to prescribe additional properties regarding the coloring, e.g. fairness, 

— reduces the number of random bits needed to generate the coloring, 

— reduces the number of relevant hyperedges, and thus reduces the complex- 
ity of computing the discrepancy of the random coloring and the expected 
complexity of computing a low-discrepancy random coloring. 

Since generating random colorings for a discrepancy problem is equivalent to 
generating random roundings for the trivial solution of the corresponding ILP- 
relaxation, we believe that these methods can be applied to a broader range of 
ILPs as well. 
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Abstract. We study the problem of scheduling n jobs that arrive over 
time. We consider a non-preemptive setting on a single machine. The 
goal is to minimize the total flow time. We use extra resource competitive 
analysis: an optimal off-line algorithm which schedules jobs on a single 
machine is compared to a more powerful on-line algorithm that has £ ma- 
chines. We design an algorithm of competitive ratio 0(min(Z\^/^, 
where A is the maximum ratio between two job sizes, and provide a lower 
bound which shows that the algorithm is optimal up to a constant factor 
for any constant £. The algorithm works for a hard version of the prob- 
lem where the sizes of the smallest and the largest jobs are not known 
in advance, only A is known. This gives a trade-off between the resource 
augmentation and the competitive ratio. 

We also consider scheduling on parallel identical machines. In this case 
the optimal off-line algorithm has m machines and the on-line algorithm 
has £m machines. We give a lower bound for this case. Next, we give 
lower bounds for algorithms using resource augmentation on the speed. 
Finally, we consider scheduling with hard deadlines. 



1 Introduction 

Minimizing the total flow time is a well-known and hard problem, which has 
been studied widely both in on-line and in off-line environments nog. The 
flow time f(J) of a job J is deflned as its completion time, C{J), minus the time 
at which it arrived, r(J) (the release time of J). This measure is applicable to 
systems where the load is proportional to the total number of bits that exist in 
the system over time (both of running jobs and of waiting jobs). In this paper, 
we consider on-line algorithms using resource augmentation, and we examine the 
effects on the performance of an algorithm if it has more or faster machines than 
the off-line algorithm (see 00). 

We consider the following on-line scheduling problem. The algorithm has par- 
allel identical machines, on which it must schedule jobs with different processing 
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requirements that arrive over time. A job J (which arrives at time r( J)) with 
processing requirement P(J) (also called running time or size) that becomes 
known upon arrival, has to be assigned to one of the machines and run there 
continuously for P(J) units of time. The objective is to minimize the sum of 
flow times of all jobs. The total number of jobs is n. 

We compare on-line algorithms that are running on £m machines {£ > 1) to 
an optimal off-line algorithm, denoted by OPT, that is running on m machines 
but knows all the jobs in advance. Such on-line algorithms are also called £- 
machine algorithms, since they use £ times as much machines as the optimal 
off-line algorithm. An algorithm that uses the same number of machines as the 
off-line algorithm, but uses machines which are s > 1 times faster, is called a 
s-speed algorithm. 

For a job sequence a and an on-line algorithm A, we denote the total flow 
time of cr in the schedule of A on £m machines by A^m(o’). We denote the optimal 
total flow time for a on m machines by OPT^^a). The competitive ratio using 
resource augmentation is defined by 



rTn,em{A) = sup 



A^rni^') 
OPT^(a) ’ 



where the supremum is taken over all possible job sequences cr. The goal of an 
on-line algorithm is to minimize this ratio. 

Approximating the flow time is hard even in an off-line environment (see [3 
Ej). In an on-line environment it is well known that the best competitive ratio 
of any algorithm that uses a single machine is n (easily achieved by a greedy 
algorithm) . The problem has been studied introducing resource augmentation by 
Phillips, Stein, Torng and Wein |0|. They give algorithms with augmentation on 
the number of machines. These are an 0(logn)-machine algorithm (which has a 
competitive ratio 1 -I- o(l)) and an 0(log Z\)-machine algorithm (which achieves 
the competitive ratio 1), where A is the maximum ratio between running times 
of jobs. Both algorithms are valid for every m. 

We give an algorithm Levels and show ri^£(Levels) = 0(min(n^/^, Z\^/^)), 
where n is the number of jobs that arrive. This algorithm works for a hard 
version of this problem where the sizes of the smallest and the largest jobs are 
not known in advance; only A is known in advance. The algorithm in [2( works 
only if the job size limits are known in advance. 

Furthermore, we show that for all on-line algorithms A and number mi of off- 
line machines we have ^(A) = fi ■ This shows that Levels is 

optimal up to a constant factor for any constant I against an adversary on one 
machine. 

In a related problem on a network of links is considered. It immediately 
follows from our lower bounds, that any constant competitive algorithm has a 
polylogarithmic number of machines. More precisely, if A has a constant compet- 
itive ratio and £m machines, £ > f2 ( — \ ^ result can also 

log(min(n,z\)) J 

be deduced from Theorem 10 in However, using their proof for the general 
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lower bound would give only an exponent of Improving the exponent to be 
the tight exponent | is non-trivial. Our results imply that by choosing a given 
amount of resource augmentation, the competitive ratio is fixed. We adapt the 
lower bound for the case where the on-line algorithm has faster machines than 
the off-line algorithm. This results in a lower bound of ) on the speed 

of on-line machines, if £ = 1 . 

We also consider the following scheduling problem studied in |3I9| . Each job 
J has a deadline d{J). Instead of minimizing the flow time, we require that 
each job is finished by its deadline, effectively limiting the flow time of job J 
to d{J) — r{J). The goal is to complete all jobs on time. For this problem, 
we give lower bounds on the speed and the number of machines required for a 
non-preemptive on-line algorithm to succeed on any sequence. 

Throughout the paper, for a specific schedule ( for the jobs, we denote the 
starting time of job J by and its flow time by f({J) = Cq{J) — r{J). We 

omit the subscripts if the schedule is clear from the context. 

2 Algorithms with Resource Augmentation 

We have the following results for the case where n is not known and the case 
where OPT has the same number of machines as the on-line algorithm. 

Lemma 1. Any on-line algorithm for minimizing the total flow time on parallel 
maehines has a eompetitive ratio of if it does not known in advance, even 
if it is compared to an off-line algorithm on one machine. 

Proof. We use a number >> 1. 

One job of size 1 arrives at time 0. When it is started, N jobs of size 1/iV 
arrive with intervals of 1/iV during the next 1 time. If they are all delayed until 
time 1, no more jobs arrive and we are done. The optimal flow time on one 
machine is 3 and the online flow time is 0{N). 

On the other hand, if one of those jobs is started while the first job is running, 
N jobs of size 1/fV^ arrive with intervals of 1/iV^ during the next 1/N time. 
Depending on the decision by the online algorithm, we continue in this way or 
stop as soon as it delays N jobs (or reaches the last machine). 

When all machines are in use, the online algorithm cannot prevent a flow 
time of 0{N). □ 

Lemma 2. rg^£{A) = for all algorithms A. 

Proof. A single unit job arrives at time 0. Let t be the time at which A starts 
this job. Let p. = 2 (n-i) ■ J = 0; ■ ■ • i ^ jobs of length p are released at 
time t-\-jp. It is easy to see that the optimal total flow time is 0{£) whereas the 
flow time of A will be f2{n/t). Consequently, ri^e{A) = f2{nf£'^). □ 

We define an algorithm Levels that knows n. Levels uses £ priority queues 
Qi, . . . ,Qe (one for each machine) and £ variables Di > . . . > Di. We initialize 
Qi = 9 and Di = 0. An event is either an arrival of a new job or a completion 
of a job by a machine. Let 7 = where n is the number of jobs. 
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Algorithm Levels 

— If a few events occur at the same time, the algorithm first deals with all 
arrivals before it deals with job completions. 

— On completion of a job on machine i, if Qi ^ 0, a job of minimum release time 
among jobs with minimum processing time in Qi is scheduled immediately 
on machine i. (The job is dequeued from Qi.) 

— On arrival of a job J, let z be a minimum index of a machine for which 
Di < jP{J). If there is no such index, take i = m. If machine i is idle, J is 
immediately scheduled on machine i, and otherwise, J is enqueued into Qi. 
If P{J) > Di, Di is modified by Di t— P{J). 

We analyze the performance of Levels compared to a preemptive OPT on 
a single machine. Denote the schedule of Levels by tt. Partition the schedule 
of each machine into blocks. A block is a maximal sub-sequence of jobs of non- 
decreasing sizes, that run on one machine consecutively, without any idle time. 

— Let Ni be the number of blocks in the schedule of Levels on machine i. 

— Let Bi k be the block on machine i. 

— Let bi^k,j be the job in block Bi^k- 

— Let Ni^k be the number of jobs in Bi k. 

— Let Pi^k be the size of the largest job in blocks . . . , Bi^k i-e. 

Pi k = max max P(biri) 

’ l<r<kl<j<Ni,r ’ 

Pifi = 0 for all 1 < z < i. 

— Let I = Ui<i<£ Kk<N Pi,ki i-e. I is the set of all jobs. 

Similar to the proof in |^ , we define a pseudo-schedule 'ijj on £ machines, in which 
job is scheduled on machine i at time ST^{bi^k,j) — Pi,k-i- Note that ijj is 
not necessarily a valid schedule, since some jobs might be assigned in parallel, 
and some jobs may start before their arrival times. 

The amount that jobs are shifted backwards increases with time. Therefore, 
if there is no idle time between jobs in tt, there is no idle time between them in 
z/) either. Note that in z/>, the flow time of a job J can be smaller than P{J), and 
even negative. 

We introduce an extended flow problem. Each job J has two parameters 
r{J) and r'(J), where r'{J) < r{J). r'{J) is the pre-release time of job J. Job J 
may be assigned starting from time r'{J). The flow time is still defined by the 
completion time minus the release time, i.e. f{J) = C{J) — r{J). Going from an 
input a for the original problem to an input a' of the extended problem, requires 
definition of the values of r' for all jobs. Clearly, the optimal total flow time for 
an input a' of the extended problem is no larger than the flow time of a in the 
original problem. 

Let li be the set of jobs that run on machine z in tt. We define an in- 
stance I'i for the extended problem. /' contains the same jobs as Q. For each 
J G h, r{J) remains the same. Define r'{J) = min{r( J), S'^( J)}. Clearly 
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OPT{I) > Y^l^iOPT{Ii) > Yli=iOPT{I-), where OPT{Ii) is the preemp- 
tive optimal off-line cost for the jobs that Levels scheduled on machine i. We 
consider a preemptive optimal off-line schedule 4>i for I[ on a single machine. In 
4>i, jobs of equal size are completed in the order of arrival. Ties are broken as in 
7T. The following lemma is similar to |^. 

Lemma 3. For each j oh J G I'i, 

Proof. Since r^^{J) = r.,j,{J) for each job J, we only have to show that in 
J does not start earlier than it does in if. Assume to the contrary this is not 
always the case. Let Ji be the first job in (fi for which (Ji) < S'^(Ji). Note 
that in this case r'{Ji) < and hence r{Ji) < S^{Ji). Let t be the end of 

the last idle time before S'^(Ji), and let Bi^k be the block that contains J\. 

Suppose Pi^k-i < Then all jobs that run on machine i from time t 

until time in if are either smaller than P{J\) or have the same size, but 

are released earlier. Moreover, these jobs do not arrive earlier than time t, hence 
in (fi they do not run before time t. They do run before because they 

have higher priority, hence (Ji) > S'^(Ji), a contradiction. 

Suppose Pi^k-i > Ji was available to be run in if during the inter- 

val [r{Ji), Sjjj{Ji)] since r(Ji) < S.^{Ji). In tt, all jobs running in the interval 
[r(Ji), Ji)] are smaller than Ji (or arrived before, and have the same size), 
except for the first one, say J 2 . Since in if, all these jobs are shifted backwards 
by at least the size of J 2 , during [r( Ji), Ji)] only jobs with higher priority 
than Ji are run in if. J\ is the first job which starts later in if than it does in (fi, 
so these jobs occupy the machine until time S'^(Ji), hence □ 

Theorem 1. ri^^(Levels) = 0(n^/^). 

Proof. Using Lemma 0 we can bound the difference between if and tt. Since 
Levels(6j_fcj) = C.^{bi^k,j) + Pi,k-i ~ r{bi^kj), we have 

Levels(/) = {C.tp{b,^k,j) + Pi,k-i ~ r{bi^k,j)) 

l<i<e l<k<Ni l<j<Ni^k 

— ^ ^ {P'tp(,^i,k,j') "k{bi k,j^^ T ^ ^ Pi,k — 1 
<OPT{I)+ 

Let P the maximum job size. We show the following properties: 

Pi,k-i < jP{h,k,j) for each job h^kjA < * < - 1 
P 

Pi,k-i < yTY for each job be^k,j 

Adding both properties together we get 

Levels(/) < OPT{I) + X! P^,k-i + X! 



( 2 . 1 ) 

( 2 . 2 ) 
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<opT{i)+j Y. E i 

< OPT{I) + ^OPT{I) + n ■ 

This holds since OPT{I) > P and OPT{I) is at least the sum of all job sizes, 
and since |/| = n. 

To prove (O) we recall that bi^kj was assigned to machine i because it 
satisfied Di < 7 P( 6 i,fej). If Pi^k-i < P{bi,k,j) we are done. Otherwise the job 
of size Pi^k-i arrived before bi^k,j and hence when bi^k,j arrived, Di satisfied 
Di P Pi^k— 1 ^ hence Pi k—i ^ Di ^ 'y Pi^bi k^j') ■ 

To prove dZ 2 | we show by induction that every job J on machine i in Levels 
satisfies P{J) < This is trivial for i = 1. Assume it is true for some 

machine i > 1, then at all times Di < Pj^''~^ holds. Hence, a job J' that was 
too small for machine i satisfied P{J') < Pi/y < PH''- This completes the 
proof. □ 

We give a variant of Levels with a competitive ratio which depends on A, 
the ratio between the size of the largest job and the size of the smallest job. 
Algorithm Revised Levels: Run Levels with y = . 

Theorem 2. riy (Revised Levels) = 0(^AAP'). 

Proof. The proof is very similar to the proof of Theorem D The only difference 
in the proof is that property il'Z. Ill also holds for machine i (this follows from 
property ll'Z.'Zll and the definition of Z\), hence the competitive ratio is now y+ 1. 

□ 

Taking y = min(n^/^, we can get a competitive ratio of 0(min(n^/'^, 

3 Lower Bounds for Resource Augmentation 

Theorem 3. Let A be an on-line scheduling algorithm to minimize the total 
flow time on i machines. Then for any 1 < mi < I and sequences consisting of 
P(^) j^bs, r^^^^(A) D ^ ^ ■ 

We first describe a job sequence a and then show that it implies the theorem. 
Let n be an integer. There will be at most ™ ^ (note = 

0(n)). We build a recursively, defining the jobs according to the behavior of the 
on-line algorithm A. 

Definition A job j of size a is considered active., if the previous active job of size 
a is completed by A at least a units of time before j is assigned, and j finishes 
before or when the job that caused its arrival finishes. 

The first job in cr has size n and arrives at time 0. We consider it to be an 
active job. On an assignment of a job j of size a by A, do the following: 
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— If j is active, and all other machines are running larger jobs (all machines 
are consequently busy for at least a units of time), n jobs of size 0 arrive 
immediately. No more jobs will arrive. 

— Otherwise, if j is active, then j causes the arrival of jobs of size ^ • -f/i- 
These jobs arrive starting the time that j is assigned, every — units of 
time, until they all have arrived. 

— In all other cases {j is not active), no jobs arrive till the next job that A 
starts. 

Lemma 4. OPTi{a) < 6n. 

Proof. We show that all jobs can be assigned on a single machine, during an 
interval of length 2n, so that a job of length a has a flow time of at most 3a. 
The total flow time then follows. 

We show how to assign all jobs of a certain size a so that no active jobs of 
size a are running at the same time on on-line machines, i. e. the intervals used 
by A to run active jobs of size a, and the intervals that are used by OPT to 
run jobs of size a, are disjoint. Smaller jobs are assigned by OPT during the 
intervals in which A assigned active jobs of size a. Hence, the time slots given 
by the optimal off-line for different jobs are disjoint. 

Finally, we show how to define those time slots. A job j of size a, that arrives 
at time t, is not followed by other jobs of size a until time t + Sa. Since an active 
on-line job starts at least a units of time after the previous active job of this 
size (a) is completed, there is a time slot of size at least a during the interval 
[t, t + Sa] where no active job of size a is running on any of the on-line machines. 
The optimal off-line algorithm can assign j during that time. This is true also 
for the first job. Finally, the optimal algorithm can also manage the jobs of size 
0 easily by running them immediately when they arrive. Hence, the total time 
that the optimal off-line machine is not idle is at most 2n. □ 

We partition jobs into three types, according to the on-line assignment. A 
job that arrived during the processing of a job of size a, and has size 
is either active or passive (if it is not active, but completed before the job of 
size a is completed). Otherwise, the job is called late. Let P{a),T{a) and L{a) 
denote the number of passive, active and late jobs of size a (respectively). Let 
N{a) = P{a) +T{a) + L{a). 

Claim. T(a) > |"^(P(a) -|- T(a))] 

Proof. The number of jobs of size a that the on line algorithm can complete 
during 2a units of time (until a job can be active again) is at most 2n. □ 

Now we are ready to prove Theorem 01 

Proof. (Of Theorem 01 ) According to the definition of the sequence, N{a) = 
r^i/i . 7 "( 3 q. 7 j 1 /^), We distinguish two cases. 

Case 1. In all phases T{a) < ^N(a). Hence T{a) > ^N(a) for all a. This 
is true for a = (|)^“^n^/^ (the smallest non-zero jobs) and hence there at least 
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^>0 such jobs. Therefore the zero jobs arrive and are delayed by 
at least {^Y~^ -n^l^ units of time. Since their flow time is at least 
and the optimal flow time is at most 6n, the competitive ratio follows. 

Case 2. There is a phase where L{a) > ^N{a). Consider the phase with 
largest a in which this happens. Since for larger sizes a' we have L{a') < ^N{a'), 
we can bound the number of jobs of size a (for a = by N{a) > 

The late jobs are delayed by at least | on average. (This is 

the delay if for each job of size ZarY!^ the last \vY!^ jobs of size a that arrive 
are the ones that are late; in all other cases, the delay is bigger.) 

The total flow time is at least 



> L{a) ■ - • ZavY!^ > -n 



i/i 



(4£)i 8 Vs 



i-l 
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Since the optimal flow is 0{n), the competitive ratio follows. 



□ 



Theorem 4. Let A be an on-line scheduling algorithm to minimize the total 
flow time on I machines. Then = 17 ^ (i 2 t.Y ) ^ ^ 

maximum ratio between jobs is A. 

Proof. We adjust cr by starting with a job of size A and fixing n = A/Z^. We 
assume Z\ > 6^ so that n > 2^ and > 2, which is needed for the construction 
of the sequence. 

Starting from here, we build a sequence a' in exactly the same way as a, 
except that we do not let jobs of size 0 arrive. Clearly, OPTfla') < 6 A. We can 
follow the proof of Theorem 0 However, we now know that all the smallest jobs 
will be late. If they arrive we are in the second case of the proof; but if they do 
not, then for an earlier a we must have L{a) > ^N{a). So only Case 2 remains 
of that proof. 

The total flow is at least ^-(j^A = | ^ (because now i < £ in stead of 

i < i — 1), giving the desired competitive ratio. □ 

A direct consequence of Theorems[3and0is the following bound on the num- 
ber of machines needed to maintain a constant competitive ratio. This corollary 
can be also proved using a simple adaptation of Theorem 10 in 



Corollary 1. Any on-line algorithm for minimizing total flow time on m ma- 
chines that uses resource augmentation and has a constant competitive ratio, is 

an f2 ( — \ algorithm (on sequences of 0{n) jobs). 

\^m.^log log(min(n,Zi)) J 
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Next we consider resource augmentation on the speed as well as on the num- 
ber of machines. We consider an on-line algorithm which uses machines of speed 
s > 1. The optimal off-line algorithm uses machines of speed 1. 

Theorem 5. Let A be an on-line scheduling algorithm to minimize the total 
flow time on I machines. Let s > 1 &e the speed of the on-line machines. Then 
Tmi,t{,A) = fl 1 ^ sequences consisting of 0{n) 

jobs. Furthermore, rrm,e{A) = L2 for any 1 < mi < £. 

Proof. Again, we use a job sequence similar to a. The jobs of phase i now have 
size l/(3s^n^/^)b For the Z\-part of the proof, we fix n = Z\/(3s^)^. Similar 
calculations as in the previous proofs result in the stated lower bounds. □ 

Corollary 2. Any on-line algorithm for minimizing total flow time on m ma- 
chines that uses resource augmentation on the speed and has a constant compet- 
itive ratio, is an '>)-speed algorithm (on sequences of 0{n) jobs) and an 

Q[A^/ '^‘^^‘^'>)-speed algorithm. 



4 Hard Deadlines 

We consider the problem of non-preemptive scheduling jobs with hard deadlines. 
Each arriving job J has a deadline d{J) by which it must be completed. The 
goal is to produce a schedule, in which all jobs are scheduled such that all of 
them are completed on time (i.e. by their deadlines). We give a lower bound 
on the resource augmentation required so that all jobs finish on time. We use a 
similar lower bounding method to the method we used Section El We allow the 
on-line algorithm resource augmentation in both the number of machines and 
their speed. We compare an on-line algorithm that schedules on £ machines of 
speed s to an optimal off-line algorithm that uses a single machine of speed 1. 

Let A denote the ratio between the largest job in the sequence and the 
smallest job. The lower bound sequence consists oi I -\- I jobs Jq, ... ,Ji where 
P{Ji) = l/(2s-|- 1)*. We define release times and deadlines recursively; r{Jo) = 0 
and d(Jo) = 2 -|- 1/s. Let tt be the on-line schedule, then r{Ji^i) = 5',r(Ti) and 
d( Ji+i) = CTr{Ji). Hence J^+i runs in parallel to all jobs Jq, . . . ,Ji in any feasible 
schedule tt. 

Lemma 5. An optimal off-line algorithm on a single machine of speed 1 can 
complete all jobs on time. 

Proof. For each j > 0, P{Ji) = l/(2s -I- 1)*, hence d{Ji) — r{Ji) = P(Ji_i)/s = 
^^^P{Ji). This holds also for Jq, since P{Jo) = 1 and d(Jo) — r(Jo) = All 
jobs arriving after Ji have release times and deadlines in the interval [S'^(Ji), 
CTr{Ji)]. The optimal off-line algorithm can schedule Ji outside this time interval, 
and avoid conflict with future jobs. By induction, previous jobs are scheduled 
before r{Ji) or after d{Ji), so there is no conflict with them either. If S',n.(Ji) — 
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T{Ji) > P{Ji), schedule Ji at time r{Ji). Otherwise C^(Ji) = ST^{Ji)+P{Ji)/ s < 
r{Ji) + P{Ji){l + 1/s), hence Ji is scheduled at time C.,r{Ji) and completed at 
C^{Ji) + P{Ji) < r{Ji) + P{Ji){2 + 1/s) = d{Ji). 

It is easy to see that the on-line algorithm cannot finish all jobs on time. If the 
first (. jobs finish on time, then all £ machines are busy during the time interval 
[r{Ji),d{Ji)] and it is impossible to start Ji before time d{Ji). We omit the proof 
of the following theorem: 

Theorem 6. The on-line algorithm fails, if A> (2s -I- 1)^. 

We show some corollaries from the lower bound on A. These are necessary 
conditions for an on-line algorithm to succeed on any sequence. Given machines 
of constant speed s, the number of machines i must satisfy £ > i-e. 

£ = l7(log A). On the other hand, for a constant number £ of machines, s has to 
satisfy 2s -I- 1 > AJP , i.e. s = fl^AJP'). 

The lower bound on A clearly holds also for the case where the optimal 
off-line algorithm is allowed to use mi > 1 machines. Consider a fc-machine 
algorithm that always succeeds in building a feasible schedule (k = ijm-i), then 
k satisfies k = 0(log Z\/m) for constant s. Finally, s satisfies s = f7(Z\^/'"^) for 
constant k. 

5 Conclusions and Open Problems 

We have presented an algorithm for minimizing the flow time on £ identical 
machines with competitive ratio 0{min(A^P against an optimal off- 
line algorithm on a single machine, and we have shown a lower bound of 
£2 ^ on the competitive ratio of any algorithm, even against an 

adversary on one machine. For every constant £, this gives an exact trade-off 
between the amount of resource augmentation and the number of on-line ma- 
chines. 

An interesting remaining open problem is to find an algorithm which is op- 
timally competitive against an off-line algorithm on a single machine for any 
£. 
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Abstract. Path problems such as the maximum edge-disjoint paths 
problem, the path coloring problem, and the maximum path coloring 
problem are relevant for resource allocation in communication networks, 
in particular all-optical networks. In this paper, it is shown that maxi- 
mum path coloring can be solved optimally in polynomial time for bidi- 
rected generalized stars, even in the weighted case. Furthermore, the 
maximum edge-disjoint paths problem is proved NP-haxd for complete 
graphs (undirected or bidirected), and a constant-factor approximation 
algorithm is presented. Finally, an open problem concerning the exis- 
tence of routings that simultaneously minimize the maximum load and 
the number of colors is solved: an example for a graph and a set of 
requests is given such that any routing that minimizes the maximum 
load requires strictly more colors for path coloring than a routing that 
minimizes the number of colors. 



1 Introduction 

Resource allocation in communication networks leads to challenging optimiza- 
tion problems concerning paths in graphs. We refer to such problems as path 
problems. For example, if a network is to support bandwidth reservation or 
guaranteed quality of service, resources must be allocated to a connection on 
all links along some path between its endpoints. If the resource requirements 
of individual connections are so large that no two connections can share a link, 
connections must be routed along edge-disjoint paths. Maximizing the number of 
accepted connection requests then leads naturally to the maximum edge-disjoint 
paths problem. 

In all-optical networks with wavelength-division multiplexing, different con- 
nections can use the same fiber link simultaneously if their signals are trans- 
mitted on different wavelengths. Establishing a connection requires reserving a 
certain wavelength on all links along a path from the sender to the receiver 
(assuming that wavelength conversion is not available). Two connections that 
are routed through the same link (edge) must be assigned different wavelengths. 
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Wavelengths are conveniently viewed as colors, and the problem of minimizing 
the number of wavelengths required to establish a given set of connection re- 
quests is called the path coloring problem. If the number of wavelengths is a hard 
constraint and the goal is to maximize the number of accepted connections using 
only the available wavelengths, the maximum path coloring problem is obtained. 

For path problems of this type, no approximation algorithms with good ap- 
proximation ratio are known for general network topologies. Therefore, investi- 
gating these problems for specific classes of graphs (where better ratios can often 
be obtained) is an interesting field of research. In this paper, we present various 
new results concerning path problems in specific classes of graphs. 

Preliminaries. The communication network is modeled as a graph G = {V, E). 
G can be an undirected graph or a directed graph. An important subclass of 
directed graphs are the bidirected graphs. These are directed graphs for which 
(u,v) G E implies {v,u) G E. All of the path problems defined in the following 
can be studied for undirected paths in undirected graphs and for directed paths 
in directed graphs or bidirected graphs. We consider only undirected graphs and 
bidirected graphs. 

A request is given by a pair (u, v) of vertices in G. In the directed case, the 
request {u, v) asks for a directed path from u to in G. In the undirected case, 
the order of u and v does not matter, and the request {u, v) just asks for an 
undirected path with endpoints u and v in G. Now we can define the following 
path problems. 

Path coloring (PC): Given a graph G = (V, E) and a list R of requests in G, 
assign paths and colors to the requests in R such that paths receive different 
colors if they share an edge. Minimize the number of colors used. 
Maximum path coloring (MaxPC): Given a graph G = (V,E), a list R of 
requests in G, and a number k of available colors, compute a subset R' of R 
and assign paths and colors to the requests in R' such that paths sharing an 
edge receive different colors and at most k colors are used. Maximize |i?'|. 
Maximum edge-disjoint paths (MEDP): Given a graph G = {V,E) and a 
list R of requests in G, compute a subset R' of R and assign edge-disjoint 
paths to the requests in R' . Maximize |i?'|. 

Note that MEDP is the special case of MaxPG for k = 1. For MEDP and MaxPG, 
it is also meaningful to consider a weighted version of the problem: each request 
r has a certain weight w^, and the goal is to maximize the sum of the weights 
of the requests in R'. 

For PG, an algorithm is a p-approximation algorithm if it runs in polynomial 
time and uses at most p- OPT colors, where OPT is the number of colors used in 
an optimal solution. For MEDP and MaxPG, an algorithm is a p-approximation 
algorithm if it runs in polynomial time and accepts at least OPT j p requests, 
where OPT is the number of requests accepted in an optimal solution. For the 
weighted version, the definition is analogous. If an algorithm is a p-approximation 
algorithm, we say that it achieves approximation ratio p. 
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For a given list R of requests in a graph G = (V, E), we call any assignment 
of valid paths to the requests in i? a routing. For each routing (j), the maximum 
load L(G, 4>) is the maximum number of paths going through the same edge. By 
L*(G, R) we denote the maximum load of that routing for i? in G that minimizes 
the maximum load. For a routing </>, we denote by x(G, </>) the minimum number 
of colors required to color the paths in <j). By x(G, R) we denote the minimum 
of x{G, 4>) over all routings </> for R in G. 

A star with n nodes is a graph consisting of one node c in the center and 
n — 1 nodes adjacent to c but not adjacent to each other. A generalized star is 
the graph obtained from a star by replacing each edge by a path of arbitrary 
length. These paths are called legs, and generalized stars are also called spiders. 
A complete graph is a graph in which every pair of vertices is joined by an edge. 
The bidirected versions of these graphs are obtained by replacing each undirected 
edge by two directed edges with opposite directions. 

Previous Work. The path problems defined above have been studied inten- 
sively during the past few years, both for arbitrary directed and undirected 
graphs as well as for specific classes of graphs. A comprehensive introduction 
to edge-disjoint paths problems along with several approximation results can be 
found in [12]. For arbitrary directed graphs, it is known that unless P = NP 
no approximation algorithm for MEDP with ratio can exist for any 

e > 0, where m is the number of edges of the graph [10]. 

The MaxPC problem can be solved optimally in polynomial time for undi- 
rected or bidirected chains, because there it is equivalent to the maximum k- 
colorable induced subgraph problem for interval graphs [4]. 

Path coloring has been studied for chains, trees, rings, meshes, and com- 
plete graphs, for example. Surveys of known results can be found in [3,11,8]. For 
bidirected generalized stars, it was shown that PC can be solved optimally in 
polynomial time and that the optimal number of colors is equal to the maxi- 
mum load [7]. For complete graphs, an 0(l)-approximation algorithm for call 
scheduling, a generalization of path coloring, is given in [5]. 

Our Results. In Section 2, we prove that the weighted version of MaxPC can be 
solved optimally in polynomial time for bidirected generalized stars. Bidirected 
generalized stars are the first class of graphs besides chains and bidirected stars 
for which MaxPC is proved to be solvable in polynomial time. In Section 3, we 
show that MEDP is AP-hard for complete graphs, and we give a constant-factor 
approximation algorithm for this problem. In Section 4, we solve an open problem 
that was stated in [3] and [8] by constructing a list R of requests in a bidirected 
graph G such that L* (G, R) = 2 and any routing with maximum load 2 requires 
strictly more colors for path coloring than the routing that minimizes the number 
of colors (for which the maximum load is 3) . 



2 Weighted MaxPC in Generalized Stars 

Let G = {V, E) be a bidirected generalized star with ^ legs and let c denote 
the center of G. Since the path for a request in G is uniquely determined by its 
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endpoints, we assume that the input consists of G, a list P of weighted paths 
in G, and a number k of available colors. The weight of path p G P is denoted 
by Wp. We give an efficient algorithm that computes a subset P' C P satisfying 
L{G,P') < k and maximizing the total weight w(P') = Wp- Since any 

set of paths with maximum load at most k can be colored with k colors in a 
generalized star (see [7]), this set P' gives an optimal solution to the weighted 
version of MaxPC. 

Our algorithm works as follows. We transform the weighted MaxPC problem 
to a minimum cost network flow problem (see [2] for a definition of this problem) 
in a directed network G' = {V , E'). Intuitively, G' is obtained by replacing each 
leg of G by an in-leg and an out-leg, connecting a new source s to the beginnings 
of all in-legs, and connecting the endings of all out-legs to a new target t. An 
additional path edge is added for each path in P. The costs are assigned to the 
edges in such a way that the path edges used by a min-cost flow of value £k from 
s to t correspond to the paths in an optimal solution to weighted MaxPC. This is 
a generalization of the technique used to solve the maximum weight fc-colorable 
subgraph problem for interval graphs in [4] . 

A more formal description of the construction follows. An example is shown 
in Fig. 1 (where G is drawn as an undirected graph for the sake of simplicity). 
Define V' = U U {c, s,t}, where P™ (resp. P°^^*) contains one copy 

(resp. one copy v°“*) of every vertex v G V \ {c}, and s and t are two new 
vertices. Edges of G that are directed towards the center are called inward-edges, 
edges directed away from the center outward-edges. A leg of G with vertices 
vi,V 2 , ■ ■ ■ ,Vj,c (listed from the leaf towards the center) creates an in-leg and an 
out-leg in G". The in-leg consists of the edges («)",«“)) •••) (^)”>c). 

The out-leg has the edges (c, u™*), (u°“*, u°“\), ..., (u™‘,u™‘). In addition, 
we have edges (s,u“) and (u°“‘,t) in G' for all leaves v of G. All edges of G' 
introduced so far are called regular edges and have capacity k and cost 0. 

Finally, we add a path edge Cp to G' for every path p G P. Assume that p 
starts at u and ends at v. If the first edge of p is an inward-edge, the tail of Cp 
is If p starts at c, the tail of Cp is c. If p does not start at c and the first 
edge of p is an outward-edge, the tail of Cp is . The head of Cp is defined in 
an analogous way. Intuitively, this definition ensures that the path in G' from 
the tail of Cp to the head of Cp using only regular edges corresponds to the path 
p in G. The path edge Cp corresponding to path p G P has capacity 1 and 
cost —Wp. 

Lemma 1. The minimum cost of a flow of value £k from s to t in G' is equal 
to the negative of the weight of an optimal solution to weighted MaxPC in G. 

Proof. Note that there is always a minimum cost flow for which the flow on all 
edges is integral. We prove that a solution P' to weighted MaxPC with total 
weight w{P') can be transformed into an integral flow of value £k and cost 
—w{P') in G', and vice versa. Assume that a solution P' to weighted MaxPC 
is given. Assign flow 1 to all path edges Cp for p G P' and flow 0 to all path 
edges 6q ior q G P\P' . It is easy to see that the flow on the regular edges can 
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Fig. 1. The generalized star G and the corresponding flow network G' . 



be adjusted so that a valid flow from s to t of value £k is obtained: the flow 
on edges incident to s or t is set to k, and the flow on one of the other regular 
edges e is set to A: — Le{G,P'), where Le{G,P') is the number of paths in P' 
that contain e. 

Now assume that an integral flow / of value £k and cost G in G' is given. 
Since s and t have £ outgoing resp. incoming edges of capacity k, the flow on each 
of these edges must be equal to k. Intuitively, we have a flow of value exactly 
k flowing into each in-leg and out of each out-leg. Let P' be the set of paths 
corresponding to the path edges with flow 1. Obviously, we have w{P') = —G. 
We claim that L{G,P') < k, implying that P' is a feasible solution to weighted 
MaxPC. Assume for a contradiction that Le{G,P') > k for some edge e of G. 
Assume that e is an outward-edge. (The case for inward-edges is analogous.) 
Then a total amount of flow larger than k reaches the part of the out-leg con- 
taining e that lies between e and the ending of that out-leg. Since only flow at 
most k can flow out of the out-leg, this contradicts the assumption that we had 
a valid flow. Therefore, Lf.{G,P') > k cannot occur, and P' must be a feasible 
solution. □ 

From Lemma 1 and its proof it is clear that weighted MaxPC in bidirected 
generalized stars can be reduced (in linear time) to the min-cost network flow 
problem in acyclic directed graphs with integer capacities. Since the latter prob- 
lem can be solved efficiently (see, e.g., [14]), we obtain the following theorem. 

Theorem 1. There exists an efficient optimal algorithm for weighted MaxPC 
in bidirected generalized stars. 

3 MEDP in Complete Graphs 

We consider MEDP in complete graphs. For a given list R of requests, the goal 
is to select a largest subset of R and connect the pairs in the subset along edge- 
disjoint paths. Multiple occurrence of the same pair in R is allowed; otherwise. 
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the problem is trivial since each request in R can be routed along its direct edge. 
We prove that this problem is iVP-hard, and we propose a simple constant-factor 
approximation algorithm. 

3.1 NP-Completeness 

The problem of deciding for a given list of requests in a graph whether all 
requests can be established along edge-disjoint paths is called the edge-disjoint 
paths problem (EDP). In this section we prove that EDP in complete graphs is 
A^P-complete. As MEDP is a generalization of EDP, this implies that MEDP is 
AP-hard. 

Theorem 2. EDP in complete graphs is NP -complete, both in the undirected 
case and in the bidirected case. 

Proof. First, we give the proof for the undirected case. We present a reduction 
from EDP in arbitrary graphs, which is known to be AP-complete [6], to EDP 
in complete graphs. Let an instance / of EDP in arbitrary graphs be given by 
an undirected graph H = {V, E) and a list R of requests. Take G to be the 
complete graph with vertex set V. Let F denote the edge set of G. Let R' be 
the list of requests consisting of one request (u, v) for every edge {m, u} in F\E. 
Thus, R' contains \F\ — \E\ requests. By Ro R' we denote the concatenation of 
the lists R and R' . Let /' be the instance of EDP in complete graphs given by 
the graph G and the list of requests Ro R'. Obviously, I' can be constructed 
from / in polynomial time. We claim that I' is a yes-instance if and only if / is 
a yes-instance. This is proved as follows. 

Assume that / is a yes-instance, i.e., all requests in R can be routed in 
H along edge-disjoint paths. Then a solution to I' is obtained by routing the 
requests in R in the graph G in the same way as in H and routing each request 
in R' using the direct edge connecting the endpoints of that request. Thus, /' is 
also a yes-instance in this case. 

For the other direction, assume that /' is a yes-instance. Let ^ be a routing 
of the requests in Ro R' along edge-disjoint paths in G. If each request in R' is 
routed using the direct edge connecting the endpoints of that request, the paths 
assigned to the requests in R must be contained in iJ and, therefore, provide a 
solution to I. Now assume that some request {u, v) in R' is not routed along the 
edge {m, v} by the routing (ft. If the edge {m, v} is not used by (j) at all, we can 
simply reroute the request {u,v) to use the edge {u, f}. If some other request r 
is routed through edge {u, w} by 4>, we can replace the edge {u, u} on the path 
assigned to r by the path from m to w assigned to {u, v) by (j) (if the resulting path 
is not a simple path, discard all loops). Then the request (u,v) can be rerouted 
using the edge {u,?^}, and the resulting set of paths is still edge-disjoint. This 
operation can be repeated until all requests in R' are routed along the direct 
connections. Then the argument above can be applied, showing again that / is 
a yes-instance. 

Therefore, the above construction constitutes a polynomial-time reduction 
from EDP in arbitrary graphs to EDP in complete graphs, showing that EDP 
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in complete graphs is A'^P-complete. (It is easy to see that the problem EDP is 
contained in NP.) 

For the bidirected case, a reduction from EDP in arbitrary directed graphs 
(which is also known to be IVP-complete, even if only two requests are given) 
can be obtained in the same way: the requests in R' are used to ensure that the 
requests in R must be routed along edges of the given directed graph. □ 

3.2 A Constant-Factor Approximation Algorithm 

Let G = {V, E) be an undirected complete graph with n nodes. Split V into three 
parts, A, B and C of roughly equal size. For the sake of simplicity, we assume in 
the following that n is a multiple of 6. Then we have |A| = \B\ = \C\ = 

We partition the given list R of requests into three disjoint lists Si = {(s^, ti) \ 
Si, ti G i?V Si G A,ti G B\/ Si G B ,ti G A}, S 2 = {(si, ti) \ Si, ti G CV Si G B,ti G 
CW Si G C , ti G B') and — {(s^,tj) | Si,ti G A\f Si G A,ti G CW Si G C , ti G A}. 
Note that S\ contains no request with an endpoint in C, S 2 contains no request 
with an endpoint in A, and S 3 contains no request with an endpoint in B. We 
will run an algorithm for each list independently and then choose the largest 
of the three solutions. In this way we lose at most a factor of 3, i.e., we get a 
3c-approximation for MEDP in complete graphs provided that the algorithm for 
each list is a c-approximation algorithm. This is because at least one third of 
the paths in the optimal solution must connect requests in the same list. 

In the following, we focus on the algorithm for one list, say S'!, and its 
analysis. Denote with OPT{S\, H) the number of paths in an optimal solution 
for S\ in graph PI and with ALG{S\, P[) the number of paths in the solution 
computed by an algorithm ALG. 

Now transform G into G' = {V ,E') in the following way: merge all nodes 
from C into one node Cm, and let V = A U i? U {cm}- For all u G A U B there is 
an edge {v,Cm} in E' with capacity n. View S'! as a list of requests in the star 
G' , and consider the problem of determining a largest subset S' of such that 
no edge of G' is used by more than n paths. We claim that an optimal solution 
for this problem is at least as large as an optimal solution for S'! in G. 

Lemma 2. OPT{Si,G') > OPT{Si,G) 

Proof. We show that any solution in G can be transformed into a solution in G' 
of the same cardinality. Let (pci^i) be a feasible routing for a subset of S'! in G. 
For each path pi G f>a(Si) that connects some Si and ti, we take the path in G' 
of length 2 from Si to ti with intermediate node Cm ■ All these paths constitute 
a routing (/>g'(Si) in G'. Every node in G has degree n — 1, thus there can be 
at most n — 1 paths in (fc(Si) with the same endpoint. On the other hand, the 
edge capacities in G' are set to n, so we can realize all requests in (/>g(Si) in G' 
without exceeding an edge capacity. Thus |(()g(5'i)| = |</>g'('S'i)|- □ 

Now transform G' into G" by reducing the edge capacities to We claim 
that an optimal solution for Ai in G" is at least a constant fraction of the optimal 
solution for Si in G' . 
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Lemma 3. OPT{Si,G”) > 

Proof. The paths in the optimal solution S* for Si in G' can be partitioned into 
^ classes of edge-disjoint paths in G' . This can be done by edge-coloring the 
corresponding multigraph containing a vertex for every node in A \J B and an 
edge {w,u} for every path with endpoints u and in S'*. This multigraph has 
degree at most n. We apply the following classical result due to Shannon [13] 
on the edge-coloring of multigraphs: Any multigraph can be edge-colored with 
^5 max colors, where 5max is the maximum degree of a node in the multigraph. 
By taking the ^ of these ^ color classes containing the most paths, we obtain a 

solution in G" with cardinality at least OPT{Si,G')^ = OPT{Si,G')/9. □ 

2 

Furthermore, we can show that a simple greedy algorithm achieves approx- 
imation ratio 2 for Si in G" . This greedy algorithm, called GR, considers the 
requests from Si in arbitrary order and accepts each path if its acceptance does 
not exceed the edge capacities. 

Lemma 4. GR{Si,G") > opt(s^uG") 

Proof. Since each path uses only two edges of G", each path accepted by the 
greedy algorithm can block at most two paths from the optimal solution. □ 

Lemma 5. There exists an algorithm computing a solution for S\ in G contain- 
ing at least paths. 

Proof. The proof follows from Lemmas 2-4 and the fact that the output of the 
greedy algorithm for Si in G" can be transformed into a solution in G of the 
same cardinality as follows. Let Greedy (S\, G" ) be the output of the greedy 
algorithm. Consider the requests (s,t) in Greedy(Si,G" ) in arbitrary order. For 
request (s, t), find a vertex v G G such that the edges {s, u} and {u, t} are not yet 
used by any other path, and add (s, t) with assigned path s-v-t to the solution. 
Such a vertex v can always be found, because {s,?^} and {u,t} are only used by 
requests with endpoints s or t, and Greedy ( S i, G" ) contains at most 2(^ — 1) < | 
such requests besides (s,t). Note that G contains | vertices. □ 

Since we have an 18-approximation algorithm for each of the lists S\, S 2 , S 3 
by Lemma 5, we obtain a 54-approximation algorithm for MEDP in undirected 
complete graphs by running the algorithm for the individual lists and taking the 
largest of the three solutions. 

In fact, the problem for S± in G" can be solved optimally in polynomial time: 
if the list Si of requests is transformed into a multigraph M similar to the proof 
of Lemma 3, then the optimal solution in G" corresponds to a subgraph of M 
with maximum degree ^ such that the number of edges in M is maximized. 
This problem is a special case of the capacitated 6-matching problem, which 
can be solved optimally in polynomial time (see [9, pp. 257-259]), even in the 
weighted case. If we apply this algorithm instead of the greedy algorithm, we 
get an optimal solution for each list Si in the corresponding graph G". So we 
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avoid losing a factor of 2 (Lemma 4) in the approximation ratio and obtain 
approximation ratio 27 instead of 54. This works also in the weighted case. 

Furthermore, our algorithm can be adapted to the bidirected case without 
any difficulties. In fact, the approximation ratio gets better, since the edges of 
a bipartite multigraph can always be colored with 5max colors. Thus, the factor 
in Lemma 3 becomes 6 instead of 9, giving approximation ratio 18 in this case. 

Theorem 3. There is a 27 -approximation algorithm for the weighted version 
of MEDP in undirected complete graphs and an IS -approximation algorithm for 
bidirected complete graphs. 

Finally, we remark that our analysis can be used to show that the natu- 
ral shortest-path-first greedy algorithm also gives approximation ratio 0(1) for 
MEDP in complete graphs. This algorithm works as follows: As long as one of 
the given requests can still be routed without intersecting a previously accepted 
path, select a request that can be routed along a path using the smallest pos- 
sible number of edges and accept it. Repeat this until all remaining requests 
are blocked by the accepted paths. In a complete graph, this algorithm will first 
accept requests that it can route using only one edge, then requests that it can 
route using two edges, and so on. The proof of Lemma 5 shows that there al- 
ways exists a solution S that uses only paths of length at most two and whose 
cardinality is a constant fraction of the cardinality of the optimal solution. We 
can compare the solution computed by the shortest-path-first greedy algorithm 
with this solution S. Each time the algorithm adds a path of length at most two 
to the solution, this blocks at most two paths from S. By the time the algorithm 
starts to consider paths of length longer than two, all paths from S must be 
blocked. Therefore, the solution computed by the algorithm has cardinality at 
least \S\/2. 

4 Minimizing the Load versus the Number of Colors 

The following question is stated as Question 4 in [3] and as Open Problem 3.1 
in [8]. 

Does there always exist, for every set R of requests in a bidirected graph 

G, a routing 4> with L(G, (j>) = L*{G, R) and x(G, 4>) = x(G, R) ? 

We solve this open question by giving a negative answer: we give a list of requests 
i? in a bidirected graph G such that the optimal routing with respect to the 
maximum load differs from the optimal routing with respect to the number of 
colors. The construction makes use of a variant of the brick wall graph, which 
is a graph similar to the two-dimensional mesh, but with internal vertices of 
degree 3 instead of 4. Brick wall graphs have been used in [1] to prove extremal 
results concerning path coloring problems. 

More precisely, the graph G consists of a brick wall part with vertices si, 
S 2 ) • • • ) Sfc on the left boundary (numbered from top to bottom) and with vertices 
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detailed view of one brick 




Fig. 2. The graph consisting of brick wall part and private chains. The canonical paths 
for the short requests are shown only on the private chain from si to t\. The right-hand 
side shows an enlarged view of one “brick” and its short requests. 



ti, t 2 , ■ ■ ■ , tk on the bottom boundary (numbered from left to right), as well as 
separate chains from Sj to U for i = 1, ... ,k, which we call private chains. 
The list R of requests contains the requests (si,ti) for i = 1, ... ,k as well as 
additional short requests that we will specify below. For k = 5, the construction 
is illustrated in Fig. 2, where G is shown as an undirected graph for the sake of 
simplicity. The construction can be generalized to arbitrary values of k. 

The idea of the construction is that the requests (si,ti) can either be routed 
in the brick wall part of G, giving the smallest possible maximum load 2 but 
requiring k colors for path coloring, or along the private chains, giving a higher 
load (because of the short requests that we will add) but requiring the smallest 
possible number of colors. 

For i = 1, . . . , fc, the private chain from Si to ti has the four vertices rn, r^ 2 , 
ri 3 , and rn between Si and ti. We add to R two copies of each of the three short 
requests {rn,ri 2 ), (ri 2 ,ri 3 ), and (ri 3 ,rj 4 ). These are six short requests in total. 
We also add the reverse short requests for these six requests. The short requests 
are illustrated for the private chain from si to ti in Fig. 2. 

Each “edge” between two nodes of degree 3 in the brick wall part of the 
graph is in fact a chain with 4 internal nodes. On each chain corresponding to 
a horizontal “edge” of the brick wall, we add six short requests from right to 
left in the same way as for the private chains. On each chain corresponding to a 
vertical “edge”, we add six short requests directed upward. See the right-hand 
side of Fig. 2. 

In total, R contains the k requests from Si to ti, i = 1,. . . ,k, as well as 
12 short requests on each of the k private chains and 6 short requests on each 
“edge” of the brick wall part. 

We refer to the routing of a short request along the direct edge connecting 
its endpoints as the canonical routing for that request. If three or more of the six 
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short requests with the same direction on a chain, say on the private chain from 
Si to ti, are not routed in the canonical way, the load on the edge {rn^Si) will 
be at least 3. If at most two of the short requests are not routed in the canonical 
way, one of the edges (rjj, must have load 2 already because of the two 

short requests corresponding to the edge. Therefore, we can state the following 
fact. 

Fact 1. Let (p be an arbitrary routing of R in G with maximum load 2. Consider 
a chain from x to y on which six short requests are contained in R in the direction 
from X to y. Then no other request can be routed along this chain in the direction 
from x to y. 

Theorem 4. For the instance constructed as described above, we have: 

(i) For any routing with maximum load L*{G,R) = 2, at least k colors are 

required to color the paths. 

(ii) There exists a routing with maximum load 3 such that the paths can be colored 

with x(G, i?) = 3 colors. 

Proof. First, we convince ourselves that L*{G, R) = 2. Obviously, even the short 
requests on one private chain cannot be routed with maximum load smaller than 
two. Therefore, L*(G,R) > 2. A routing with maximum load 2 is obtained by 
routing all short requests in the canonical way and routing the requests (si,ti) 
in the brick wall part of the graph as sketched in Fig. 2. 

(i) Let a routing with maximum load 2 be given. By Fact 1, no path for a 
request (si,ti) can be routed along a private chain (in either direction), nor can 
it be routed upward or leftward anywhere in the brick wall part of the graph. 
Therefore, the path for (si,ti) must run from Si to ti traveling only to the 
right and downward. Furthermore, any two of the paths for the requests {si,tf), 
i = 1, . . . , fc, must meet at some vertex of the brick wall and, by construction, 
this implies that they intersect in an edge. Therefore, the paths for the k requests 
(si,ti) must receive different colors. 

(ii) The routing is obtained as follows. Route all short requests in the canon- 

ical way, and route the k requests {si,ti) along the private chain from Si to U. 
The load of the resulting routing is 3. The paths for the requests (si,ti) can all 
be assigned the same color, and each path for a short request can be assigned 
one of two other colors in a greedy way. Therefore, three colors suffice. Since no 
routing exists for which two colors suffice (see (i)), this implies x(G, i?) =3. □ 

We remark that the construction can be modified such that no two requests 
in R are between the same pair of nodes. 

Furthermore, note that the construction can be adapted to undirected graphs 
as follows. Take the undirected graph underlying the bidirected graph used 
above. For the list of requests, take the k requests (si,ti) as well as six short 
requests with the same direction on every private chain. The short requests in 
the other direction and in the brick wall part are no longer required, because 
two undirected paths crossing in the brick wall part must always share an edge 
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in the undirected case, since all vertices in the brick wall part have degree at 

most 3. Again, any routing that minimizes the maximum load requires strictly 

more colors than the routing that optimizes the number of colors. 
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Abstract. We study the problem of scheduling n independent general 
multiprocessor tasks on a fixed number of processors, where the 
objective is to compute a non-preemptive schedule minimizing the 
average weighted completion time. For each task, its execution time is 
given as a function of the subset of processors assigned to the task. We 
propose here a polynomial-time approximation scheme for the problem 
that computes a (1 -|- e)-approximate solution in 0(n log n) time for any 
fixed e > 0 accuracy. This provides a generalization and integration of 
some recent polynomial-time approximation schemes for scheduling jobs 
on unrelated machines |l|i^ and multiprocessor tasks on dedicated 
processors | 2 ], respectively, with the average weighted completion time 
objective, since the latter models are included as special cases in our 
problem. 

Keywords: Parallel processing, scheduling, multiprocessor tasks. 



1 Introduction 

In this paper we address multiprocessor scheduling problems, where a set of n 
tasks has to be executed by a set of m processors such that each processor can 
work on at most one task at a time and a task can (or may need to be) processed 
simultaneously by several processors. Here we assume that m is fixed and the 
objective is to minimize the average weighted completion time where 

Cj denotes the completion time of task j. In the dedicated variant of this model, 
denoted by Pm\fixj \ ^ WjCj, each task requires the simultaneous use of a pre- 
specified set of processors. In the parallel variant, denoted by Pm\sizej \ ^ WjCj, 
the multiprocessor architecture is disregarded and for each task there is given a 
prespecified number which indicates that the task can be processed by any sub- 
set of processors of the cardinality equal to this number. In the general model 
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Mehrskalenmethoden” and by EU project APPOL “Approximation and On-line Al- 
gorithms”, IST-1999-14084 
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Pm\setj I ^ WjCj, each task can have a number of alternative modes, where each 
processing mode is specified by a subset of processors and the execution time of 
the task on that particular processor set. 

Previous results: Variants of these problems have been studied, but the pre- 
vious research has mainly focused on the objective of minimizing the makespan 
Cmax = Cj . Regarding the worst-case time complexity, it is known that 

Pb\sizej\Cmax in], P‘^\fiXj\Cmax ^ and P‘i\setj\Cmax |I^ are strongly NP- 
hard. However, there is a polynomial-time approximation scheme (PTAS) for 
Pml/icCjlCniax P, and there is a PTAS for Pm\setj\Cmaii jf OB 7) . 

The first PTAS for a strongly NP-hard scheduling problem minimizing the 
average weighted completion time was given for scheduling jobs on identical par- 
allel machines P|| ^ WjCj jEj- Then recently it was proved in [I] that there are 
PTASs for many different variants of classical scheduling problems. These re- 
sults include scheduling on unrelated parallel machines Rm\ \ 0- 

multiprocessor setting, in contrast to the makespan objective, only few approxi- 
mation results are known. Furthermore, they concern only minimizing the sum of 
completion times: There are a 2-approximation algorithm for P2\fixj\ '^Cj jS|, 
a 32-approximation algorithm for P\sizej \ ^ Cj j2!I|, and - as it was shown re- 
cently - a PTAS for Pm\fiXj \ ^ Cj [2|. 

New results: The problem of scheduling general multiprocessor tasks Pm\setj\ 
'^WjCj can be also viewed as a generalization of two well (but mainly inde- 
pendently) studied scheduling problems: scheduling tasks on unrelated parallel 
machines i?m|| ^ WjCj and multiprocessor task scheduling with dedicated pro- 
cessors Pm\fixj \ ^ Wj Cj. In the case of unrelated machines, for each task (job) 
there are m processing modes, each with a single processor (machine). In the 
case of dedicated processor sets, each task has only a single processing mode but 
including (typically) several processors. Since both of the above special cases are 
strongly NP-hard m for general weights and m > 2, even if there are only a 
constant number of processors, it is natural to study how closely the optimum 
can be approximated by efficient algorithms. 

In this paper, focusing on the case where m is fixed, we integrate many of 
the above mentioned recent results that have shown the existence of PTASs for 
the two special cases, by providing the following generalization: 

Theorem 1. There is a PTAS for Pm\setj\'^WjCj that computes, for any 
fixed m and e > 0 accuracy, a (1 -|- e) -approximate solution in 0(n log n) time. 

The actual running time of the proposed algorithm depends exponentially on 
both m and 1/e. However, the above result is, in some sense, the strongest 
possible one someone can expect. First, it shows the existence of a PTAS for a 
problem with fixed parameter m that cannot have a fully PTAS m Second, 
following the ideas in m and by using the results mm one can prove that 
both P\setj,pj = l\Cmax and P\setj,pj = cannot be approximated 

within a factor of m^~‘^ , neither for some e > 0, unless P=NP; nor for any 
e > 0, unless NP=ZPP. Hence, not only the above results cannot be extended 
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or generalized for the general variant where m is not fixed, but even substantially 
weaker approximation results cannot be expected. 

Our approach: In this paper we employ a well known idea of transformations 
- simplify instances and schedules with some loss in the objective. We refine 
some recent approximation techniques developed in mm- In order to be able 
to cope with general multiprocessor tasks, we make adjustments of processing 
times defining the task profiles and classifying the tasks as huge and tiny. This 
requires the creation of a sequence of gaps in the schedule, where all processors 
are idle for a certain period of time. In our approach, we also apply some recent 
makespan minimization tools from HD to schedule tasks within single intervals. 
For handling tiny tasks, we use a linear programming formulation along with 
some rounding, and observe that in some near optimal schedule tiny tasks of 
the same profile can be scheduled by Smith’s rule uni- Then, by using the task 
delaying technique presented in we introduce special compact instances in 
which there is only a constant number of tasks can be potentially scheduled in 
an interval. The final idea is to use dynamic programming which integrates all 
the previous components. The obtained PTAS is a combination of the instance 
transformations and the dynamic programming algorithm. 

The paper is organized as follows: In Section 2, we give the definitions and 
discribe some techniques. In Section 3, we consider the subproblem of scheduling 
in one subinterval, adopt the PTAS for Pm\setj\Ciaayi ini> and and discuss some 
consequences. In Section 4, we present a dynamic programming framework which 
can be used to integrate all the previous components. 

Remarks: This work was motivated by P, where it was also announced that 
there is a PTAS for scheduling on unrelated parallel machines with release dates 
Rm\rj \ WjCj (to our best knowledge, the full proof of this result has not yet 
appeared in literature), and our very recent work j 1 ,4) . where we have shown the 
existence of a PTAS for Pm\fixj,rj\ Y WjCj. Our original goal was to provide 
a generalization for all previous results on scheduling problems involving a fixed 
number of processors (machines), release dates and the average weighted com- 
pletion time objective. In this paper we don’t achieve this goal completely, but 
provide hopefully a major step towards it by presenting a few novel ideas and 
techniques that might be also of interest for people working on approximation 
algorithms for other scheduling problems. 

2 Preliminaries 

Formally, for each instance / of our problem we are given sets T = 
{0, 1, . . . , n — 1} of n tasks and M = {1, . . . , m} of m processors. (Let 2^ denote 
the set of all subsets of M.) Each task j has a positive weight Wj and an associ- 
ated function p.j : 2^ — ^ IR+ U {+oo} that gives the execution time p^j of task 
j in terms of the set of processors p. C M which is assigned to j. Given the set 
fJ-ij) ^ M allotted to task j, the processors of fi{j) are required to execute task j 
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in union and without preemption, i.e. they all have to start processing of task j 
at some starting time Sj, and finish at the completion time Cj := Sj +Pfj,(j)j- A 
feasible schedule cr consists of an allotment /i(j) and a starting time Sj for each 
task j G T such that no processor executes more than one task at each time 
step. Then, the value Cw{cr) := WjCj(o') is called the average weighted 

completion time of cr. Here, the number of processors m is considered to be a 
fixed constant, and the objective is to find a feasible schedule that minimizes 
the average weighted completion time. 

Further, we write cr(/) to denote a schedule a with respect to an instance I. 
Accordingly, a schedule aopt{I) is called optimal if Cu]{<Jopt{I)) = OPT{I), where 
OPT{I) is the minimal average weighted completion time for I. Given a schedule 
cr, a schedule a' is called an e-schedule of a if Cwicr') /Cw{<j) < 1 + Ke, where K 
is some constant. We say that one can transform with 1 + 0(e) loss the objective 
and in polynomial time an instance I into instance I if the following holds: I is 
obtained form I in p(|/|) elementary operations for some polynomial p(-), any 
feasible schedule of I is also feasible of / and, OPT{I) /OPT{I) < l-\-Ke, where 
K is some constant. 

We will show below that with 1 + 0{e) loss, any instance I of our problem 
can be transformed in 0(n log n) time into a special compact instance I such 
that one can find in 0{n) time an e-schedule of (Topt{I)- Clearly, this suffices to 
obtain a PTAS for our original problem. 

2.1 Basic Techniques 

For any fixed accuracy e > 0, we assume w.l.o.g. that logi_|_£(l + ^) and ^ are 
integral. We partition the time interval (0, oo) into disjoint intervals Ix of the 
form [RxtRx+i)j where Rx = (1 + s)^ and x G 2Z. Notation Ix will also be 
used to refer to the length eRx of the interval, thus Ix = Rx+i ~ Rx = sRx and 
Ix+i = (1 -I- e)Ix- For a schedule a, let y{j) < z(j) be those indices for which 
Sj G Jy(j) and Cj G Iz{j): respectively. 

In this paper, for a given schedule a, we always use the constructive method 
to find e-schedules, i.e. we show directly how tasks can be rescheduled in cr 
without dramatically increasing the value of the objective function. Next, we 
describe some techniques that will be applied later. Knowing a , we construct a 
new schedule a' as follows: 

Stretching: Set = fJ,{j) and Sj = (1 -k e)Cj — Pp(j)j for each task j G T. 
This generates £Pp,(j)j idle time on /x(j). 

Rearranging: Set yi'{j) and Sj for each task j G T such that a new schedule 
a' is feasible and z'{j) = z(j), i.e. tasks are rescheduled preserving C' G Iz{j)- 
This gives us the ability to rearrange tasks inside intervals. Note that at any 
time then we use this technique we have to specify the way of rearranging. 
Shifting*: Set ^'(j) = fi{j) and Sj = Sj — Ry(j) + Ry[j)+\ for each task j G T, 
i.e. in a' we have y'{j) = y{j) + 1, and hence the distance between Sj and the 
beginning of interval Iy(j) is preserved. This generates e{R 2 (j) — Ry(^j)) additional 
idle time on pf (j) after the completion time Cj. 

Shifting**: Set p'{j) = p(j) and Sj = Sj — Rz{j) + Rz(j)+i for each task j G T, 
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i.e. z'{j) = z{j) + 1, and hence the distance between Cj and the end of interval 
is preserved. This generates s{Rz(j) — Ry{j)) additional idle time on /i(j) 
before the start time S''. 

Proposition 1. If a schedule a' is obtained from a schedule a by stretching, 
rearranging, or both types of shifting then a' is an e-schedule of u. 

The following lemma that will be used throughout the paper. 

Lemma 1. For any schedule a there is an e-schedule a' of a such that the 
following holds in a' : each task j starts not earlier than each task j 

crosses at most a constant number s*(e) := log2^^g(l+ of intervals; each task 
j with Pfj,'{j)j < non-crossing; each crossing task j starts at one of 

points Ry'(j) + Iy'U)i where i G {0, 1, . . . , ^ — 1}. 



2.2 Blocks, Gaps, and Profiles 

A block Bi is a set . . . , /f,/(j)} of consecutive intervals. A block structure 

61,62, - ■ ■ is a sequence of blocks such that any block 6i is of the length Si := 
b'{i) — b{i) + 1 which is is at least 2s* (ff) and at most 2S{s), any pair 6i, 6i+i of 
consecutive blocks shares exactly one interval Ib'(i) = Ib(i) (i-e. b'{i) = b{i + 1)) 
and the total length of these two blocks > i5(e), where J(£) := 

For a schedule cr, a gap is an interval in time where all the processors of M are 
idle in cr. The following Lemma shows that with any schedule one can associate 
a block structure such that there is a gap in the beginning and the end of each 
block. 

Lemma 2. For a schedule a there is an e-schedule cr' of a such that there exists 
a block structure 61 , 62 , - ■ ■ for the schedule a' with the property that for each 
dynamic block Bi there is a gap in the first interval Ib(i) - This gap starts at one 
of the points Rb(i) + ie^Ib(i), ^ = 1 • . • , ^ — 1, and has a length which is at least 

For an instance I and task j G "T, let := min^g2“ Py-j be the minimal 
processing time needed to execute task j. The value is called the length of 
j- Then, the following holds. 

Lemma 3. With 1 + 0{e) loss one ean transform in 0{n) time an instance I 
into instance I such that for each task j G T and pL G 2^ the following holds 
in I: the processing time Py_j = (1 + where u)y,j G IN U + 00 ; if Pyj yf 00 

then Pftj < h(rn,e)pf^™, where h{m,e) is a constant that depends on m and e. 
Furthermore, in I the quotients pf^™/wj are different for all tasks in T- 

For simplicity, we will use the following notations throughout the paper. By 
using the above Lemma, for a task j G T, the corresponding 2™-tuple u{j) =< 
ujyj is called the execution profile of j. Accordingly, the set of all distinct 
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possible execution profiles is denoted by 17 := {uj{j) \ j G T}. Then, it is not 
hard to prove that the number of profiles in 17 is bounded by 

i^{m,e) := [2 + logi+, /i(m, e)] 2”“ . 

In addition, for a given instance I, we introduce an index x(j) for each task 
j € 1~ that corresponds to the interval Ix{j) earlier which processing of j can not 
be started. (By using Lemma Q we assume that U) < < Rx{j)+i for 

each task j G T. However, we revise this property in Lemma El) Accordingly, 
a task j is called huge if p™'" > e^Ix(j)/q*, and tiny if < e'^Ix(j)/q* , where 
the parameter q* = q*{m,e) » 2™h(m,e) is specified later in Section ^3 To 
distinguish sets of huge and tiny tasks we write HT and TT, respectively. 

For a task set C T, we will use := denote the total 

length of the tasks in X, use to denote the tasks of X of profile w G 17, 
and Xx to denote the set of tasks j G X with x{j) = x. To indicate that X is 
associated with particular instance I, we will write X(I). Finally, regarding the 
objective function we use the following. 

Proposition 2. For any instance I , with at most 1 + e loss one can consider 
the objective function '^WjRz{j) instead of the original function 

3 Scheduling Inside Interval 

In the first part of this section, we consider the problem of scheduling in a single 
interval Ix- More precisely, we present an algorithm that schedules non-crossing 
tasks with respect to a known schedule of crossing tasks. In order to achieve 
this, we generalize the PTAS for Pm\setj\Craa.^ im Basing on some features 
of the algorithm, we define the value of parameter q* = q*{m,e) for tiny tasks, 
and consider the algorithm as a subroutine of the rearranging technique. After 
that, we show that tiny tasks are all small corresponding to intervals and can 
be scheduled by Smith’s rule m- In the last part of this section, we show how 
an instance of our problem can be transformed into a compact instance in which 
there is only a a constant number of tasks that can be potentially scheduled in 
an interval. 



3.1 Long and Short Tasks, Relative Schedules and LP Formulation 

Let a{I) be a feasible schedule for instance I. For an interval Ix of cr, the following 
sets of tasks (running in Ix) are defined: set Wx of non-crossing tasks and set 
/Cx = Kx ^ Kf. U Kx of crossing tasks, where K], is the set of incoming crossing 
tasks (that complete in Ix), Kf is the set of outgoing tasks (that start in Ix), 
and Kx is the set of throughgoing tasks (that go through Ix). Suppose that we 
have applied shifting** to cr{I) and obtained <j'{I). W.l.o.g. assume that in the 
schedule cr'{I) the tasks of and Kx run in interval Ix+i- Then, there is at 
least elx idle time on the processors, between the tasks of Ydx and the outgoing 
tasks of Kx or between the tasks of Wx and the end of interval Rx +2 ■ 
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Assume for simplicity that Wx = {0, . . . , n'} with > ■ • ■ > P™/” 

and = 0. Then, it holds that p'^^iVVx) < mix- Furthermore, we fix allotment 
/i(j) for each crossing task j G Kx and define Pfj.(j)j '■= Cj — Rx for each task 
j G Kl and p^(j)j '■= Rx+i — Sj for each task j G K^. 

Partition Wx into two subsets Cx = {0 , . ■ . , kx — 1} and Sx = Wx \ Hx-, where 
the appropriate value for kx = kx{m,e) ^ 1 will be specified later in Lemma El 
Tasks in Cx and Sx are called long and short, respectively. Let J'x = Cx U ICx be 
the set of long and crossing tasks. 

A processor assignment of J~x is a mapping px ■ Jx — t 2^ such that Pxij) = 
plyf) for each task j G U K^. Two tasks Tk and are called eompatihle, if 
Px(k) n px{h!) = 0. A snapshot of Jx is a set of compatible tasks of Jx- A 
relative schedule of Jx is a pair TZx of a processor assignment px of Jx and a 
sequence of snapshots Mi , . . . ,Mg of Jx with respect to px, such that each 
task j G Jx occurs in a subsequence of consecutive snapshots Muj, - - - ,Mvj, 
1 < Uj < Vj < g, where any two consecutive snapshots Me and Me+i are 
different and it holds that Uj = 1 if j G K^, Vj = g if j G (see Figure [Q. 
Then, it is not hard to see that the number of snapshots g is bounded by 2{kx+m) 
and the number of different (appropriate) relative schedules of Jx is bounded by 
a constant depending only on m and the number kx of long tasks in Cx- 




Fig. 1. A relative schedule TLx 



Given a set /i C M, a p-configuration Cg is a partition of p into non-empty 
sets. Let be the total number of /^-configurations and let Ci_^, . . . , be 

all /(-configurations. Note that Nm = B{m), where B{m) < ml is the mth Bell 
number. 

Consider a relative schedule TZx, where the processor assignment px such 
that no task j in Jx has the assigned processing time Pg^Oj ~ Then, let 
Fe '-= M \ UjGAif Ma:(j) denotes the set of free processors in snapshot Me, 
f<f-<g- For a (z(to, e)-vector D := a relative schedule TZx is called 

feasible for D if for any J 0 there exists a snapshot Me in TZx such that 
pQ Fe and yf -l-oo. 

Now assume there is a set of tasks ACT such that p™'"(A“) = for 
each profile w G 12. (Recall that if a task j G A“ is assigned to a processor 
set p, then the processing time of j on p is equal to p™“(l -I- e)“'‘. Thus, if all 
tasks of A“ are assigned to p, the total processing time needed to execute them 
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is equal to _D“(1 + Then, in order to find a preemptive schedule of X 

with respect to the relative schedule Ti-x, one can formulate the following linear 
program LP{TZx, D) in terms of variables associated with T^-configurations: 



Minimize 


1 

O 








s.t. (0) 


tg — Rx+l, 








(1) 


ti > tf_i, £=!,..., 


9 , 
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j e Jx, 
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^ ^ ■ ■ ■ 5 
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where the variables have the following interpretation: 

ti'. the time when snapshot A4g ends and Xii+i starts. The starting time of the 
schedule is denoted by tg = Px+i and the finishing time by tg (see Figure 
Xi^Ft- the length of the configuration Ci^Pi in snapshot Aig. During an interval 
of length Xi^Ff the tasks of X can be executed on processor subsets /i G 
yg^uj- the assignment variable indicating which part of the tasks of is assigned 
to be executed on a processor set fi G 2^, reflecting that the total processing 
time needed to execute such tasks of X‘^ is equal to ZJ“(1 + 

D(/r): the total processing time needed to execute the tasks of X assigned to a 
processor set y G 2^. 

Less formally, the constraints (0)-(2) define a schedule structure with respects 
to the relative schedule TZx, the constraint (3) defines a substructure of config- 
urations inside the snapshots of TZx, the constraints (4)-(5) define the balance 
between the free time available in the configurations and the allotment of work 
that have to be executed in order to complete all tasks. (One can also associate 
each set X‘^ with one task of the length and think in terms of preemptive 
scheduling of these v{m, e) tasks of different profiles such that execution of parts 
of the same task on different sets in parallel is allowed.) 

Let (t*,x*,y*) be an optimum solution of LP{TZx, D). Then, the following 
holds P^ . 

Lemma 4. For any two non-negative v{m, e)-veetors a and b sueh that a relative 
sehedule TZx is feasible for a and a b, the following inequality holds 



t*{LP{TZx,a)) < t*{LP{TZx,a+b)) < ^*(LP(7^„ a)) + 



max (!+£)“" 






Lemma 5. If a relative sehedule TZx is feasible for the F{m,e)-veetor D = 
then the optimal value t* — tg of LP{TZx,D) is not larger than 
the makespan of any sehedule of Jx U Sx that respects the relative schedule TZx ■ 
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3.2 Generating a Schedule, Tiny versus Short 

Let be an optimum solution of LPiTZ^, D). We construct a schedule 

for the tasks in and Sx as follows. For each profile to and > 0 we select 
the tasks of Sx in a greedy manner until the total length does not exceed y* > 0 
(if the total length plus the length of the last selected task exceeds y*^^ > 0 we 
mark this task as unbalanced, there can be at most 2"* such ones for one profile) 
and assign them to be executed on the subset fi. After that, starting from the 
first snapshot, by using the values x* > 0 and accordingly with the subsets 
y. G Ci^Ft ^ we select the tasks that are assigned to /r in a greedy manner until 
their total processing time does not exceed x* p^ > 0. If it happens that for a 
configuration the selected tasks do not fit inside the interval of the length 
X* > 0, we increase x* p^ > 0 by a small amount and accordingly, we also 
adjust the value by the total increase epi (see Figure I2). 




Fig. 2. Generating a schedule 



Since there are at most z/(m, e) profiles the total number of unbalanced tasks 
is at most v{m, e)2™. Furthermore, since the number of snapshots g < 2{kx+rn) 
and the number of configurations is bounded by the mth Bell number B(m), the 
total enlargement Stotai is at most 



+P™1 + . . •P/“+2(/c*+m)B(m)+i/(m,e)2™-l)i 



i.e. the maximum processing tasks of the kx+2{kx + rn)B{m) + i/{m, £t) 2'" longest 
tasks of Sx- The following lemma is a useful tool to select kx such that e total is 
kept small enough HH: 



Lemma 6. For any interval Ix there is a number kx with 1 < fca; < (q + 

T mh(m,e) o 

+ d[l + (g + 1) + . . . + (g + 1)^^ 



1 ) 



nh(m,e) 




such that h{m,£){p 



,min 



. . . < s^Ix, where q := 3mB(m) and d := v{m,e)2^‘ 



The final algorithm works as follows: select an appropriate kx, enumerate 
all relative schedules together with solving LPs and deducing the corresponding 
schedules, output the schedule with the smallest makespan. 



Now, we define q*{m,e) := a{m,e) j^^m +2 _|_ ^2 ^ gy 

Lemma n we may assume that tiny tasks are non-crossing in cr(/). Accordingly, 



n.h(m,e) 
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let C >Vx be the set of tiny tasks that are non-crossing in Then, the 
following holds: 

Lemma 7. The number kx of long tasks in Cx is not larger than the number of 

2 r 

non-crossing huge tasks in Ix with pj > Hence, all tiny tasks are short, i.e. 
yxQSx. 

Lemma 8. Let yx be another set of tiny tasks such that for each profile 
a; G 17, = 0 implies = 0, and yf 0 implies either 

Then algorithm outputs a 

feasible schedule ofWx '■= (Wa , inside Ix+i with respect to the schedule 
of ICx in cr'il) ■ 

Proof. The proof follows from Lemmas 033 and the fact that the interval Ix+i 
contains at least elx > e'^Ix + h{m, e) J2ujeO 2 h{m%v{m e) time. (Recall that 

m.a.Xf_,^oG,c.jenP(^j < Pr’^Hm,e).) □ 

3.3 Scheduling Tiny Tasks and Compact Instances 

Now consider the problem of placing tiny tasks in the schedule. In the following 
we restrict ourselves to the case of scheduling tiny tasks of the same profile, say 

min „min 

to. We say that two tiny tasks k,£ € TT“ with — — < — — are scheduled by 
Smith’s rule in a schedule a if it holds either z{k) < z{£), or z(£) < x{k). In 
other words, if the two tasks are available at the same interval (that means £ is 
not completed before the earliest interval where k can be scheduled), then the 
task k of smaller value Pk/wk is scheduled first with respect to intervals. Note 
that the tasks of TTf. have to be scheduled in the order of increasing values 

pimn/w- 

Lemma 9. For a feasible schedule cr(J) there is an e -schedule cr'{I) ofa{I) such 
that for any profile lo £ £2, the tiny tasks ofTT'^ are scheduled by Smith’s rule 
in cr'(/). 

Proof. W.l.o.g. let Ji, II be the first and the last interval in a feasible schedule 
a. By Lemma[3 assume that all tiny tasks are non-crossing in cr and by LemmaEl 
all quotients Pjfwj are different. Let yf be the set of tiny tasks of profile oj that 
are scheduled in interval Ix and let := p™””{yf) be the total length of these 
tasks in Ix. Consider the following LP{uj): 

Minimize ■. ierr- T.x=x(j) 
s-t. (1) = 1, VjGTT^, 

( 2 ) 

(3) yj,x > 0, Vj G TT“, X = x{j), ...,L. 

First, one can find a feasible solution of LP{uj) by assigning the values y with 
respect to the schedule cr, i.e. for each task j G TT^ set y,^x = 1 if z{j) = x and 
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yj,x = 0 otherwise. Hence, the objective value of the linear program LP(uj) is 
not larger than the weighted average completion time for the tiny tasks of 
in CT. In other words, the value of an optimal fractional solution is a lower bound 
of the weighted completion time Consider an optimal solution 

(yj,x) of linear program. Suppose that there are two tasks i and k that are 
not scheduled by Smith’s rule. W.l.o.g. we can consider the case when yj > 0, 

y*k,x, > < x{£) <xe<Xkiind^ <^. 

Then, there exist values zg and Zk such that 0 < zg < and d < Zk < y% 
and zgpY™ = Zkp'k™- Now we exchange parts of the variables: 

y'e,x, = y*e,xt - V'i,x^ = V\,x^ + 

V'k,x^ = y*k,x^ - Zk y'k,xi = Vk,xt + Zk 

and y'j ^ ~ yj x fo'' remaining variables. 

The new solution (j/ya,) is feasible and the objective value 

J2x=xU) y'j,xRx is equal to Y.jeTT- ELx(j) VpxRx + Rt,k, where Rg^k = 
{R^^ - R^^){wkZk - zgwg). By Zk = zg^^, the second factor {wkZk - zgwg) = 

zg[Wk^^ — wg). Since and zg > 0, the second factor is larger than 

0. The inequality xg < Xk implies Rx^ < Rxk Rg^k < 0. In other words, the 
new solution (j/' has a lower objective value and gives us a contradiction. This 
means that the two tasks £ and k are scheduled by Smith’s rule. 

Now we use some properties of the above linear program. There is an optimal 
solution such that for each interval Ix we have at most one task jx G 7”7”“ with 
Xj^x G (0,1) and that is assigned for the first time. Otherwise we can use the 
same argument as above (and the fact that the quotients p^^’^/wj are different) 
to improve the objective value. Furthermore, such an optimal solution can be 
obtained by applying Smith’s rule in a greedy manner. To turn such a fractional 
solution into an integral assignment of tiny tasks to intervals, we need only to 
increase the values = p™‘"(3^^) by Then, jx fits completely into Ix- 
Let yj! be the set of tiny tasks of profile uj that is assigned to an interval Ix 
with respect to the integral assignment of LP{uj) (including the task jx). For 
each interval Ix of schedule tr we consider the set yx = of all tiny tasks 

that corresponds to LP{u>), w G 17. It is not hard to check that the conditions 
of Lemma IHI hold. Thus, we apply the shifting* to cr, replace the sets yx by ^x 
and apply the rearranging technique to each interval Ix+i - The final schedule is 
constructed. 

To finish the proof, there are two facts. First, the total weighted average 
completion time of the schedule of tiny tasks which is given by the optimal frac- 
tional solutions of LP{u>), a; G 17 is less than their corresponding total weighted 
average completion time in cr. Second, we have used only the shifting and rear- 
ranging techniques. □ 

Furthermore, we can prove the following result: 
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Lemma 10. With 1 + 0(e) loss and in 0(n log n) time, one ean transform an 
instance I into instance I such that for each profile uj G f2 the following holds: 
\'T{I)x\ < N*{m,e) and p™™(T{I)x) < ^ Ix) where N*{m,e) is a constant 
that depends on m and e. 



4 The Dynamic Programming Algorithm 

As we mentioned it above, we will use a dynamic program to compute approxi- 
mate solutions for our scheduling problem. To be able to formulate this dynamic 
program and show that the algorithm solving it has the desired running time, 
we need only the following result: 

Lemma 11. For a feasible schedule <j{I) there is an e-schedule u'{I) such that 
in the tasks in T{I)x are scheduled within a constant number d*{m,e) of 

intervals that follow I^, i-e. for each task j G T it holds that z'{j) — x{j) < 
d*{m,e). 

The last step is to use dynamic programming with blocks as units. By using 
Lemmas ITTTIini there is at most a constant number of tasks in 7^ for each 
interval Ix, all of these tasks have to be scheduled within a constant number of 
intervals that follow Ix, hence one can efficiently enumerate all possible schedules 
of the tasks of Tx- Here (due to space limitation) we omit the formal description, 
leaving it to the full version of the paper. 
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Abstract. The time-dependent orienteering problem is dual to the time- 
dependent traveling salesman problem. It consists in visiting a maximum 
number of sites within a given deadline. The traveling time between two 
sites is in general dependent on the starting time. 

We provide a (2 -I- e)-approximation algorithm for the time-dependent 
orienteering problem which runs in polynomial time if the ratio between 
the maximum and minimum traveling time between any two sites is 
constant. No prior upper approximation bounds were known for this 
time-dependent problem. 



1 Introduction 

In the well known orienteering problem (see, e.g. |1I9| ) a traveler wishes to 
visit the maximum number of sites (nodes) subject to given restrictions on the 
length of the tour. This problem can be regarded as the problem of traveling 
salesperson with restricted amount of resources (time, gasoline etc.) wishing to 
maximize the number of visited sites. For this reason, the orienteering problem 
has been also called “the generalized traveling salesperson problem” or even 
originally as “the bank robber problem” PE). Even if the ratio between the 
maximum and minimum amount of resources required for traveling between two 
sites is constantly bounded, the orienteering problem is MAX-SNP-hard simply 
because the correspondingly restricted traveling salesman problem is MAX-SNP- 
hard PH] (cf. mm)- 

In this paper we consider a generalization of the orienteering problem which 
we term time- dependent orienteering (TDO for short). In our generalization, the 
cost of traveling (time cost in our terminology) from any site to any other site 
in general depends on the start moment. 

* The work of the first author was done while he was at the Centro de Modelamiento 
Matematico, Universidad de Chile and UMR 2071-CNRS, supported by FONDAP 
and while he was a visiting postdoc at DIMATIA-ITI partially supported by GACR 
201/99/0242 and by the Ministry of Education of the Czech Republic as project 
LN00A056. Also this work was supported by the RFBR grant NOl-01-00235. 

R. Freivalds (Ed.): FCT 2001, LNCS 2138, pp. 508-|^| 2001. 
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The orienteering problems considered in ^ are classified as the 
path- orienteering, cycle- orienteering, and even tree- orienteering problems de- 
pending on whether the network to be induced by the set of pairs of consecutive 
sites visited is supposed to have a form of a path, a cycle, or even a tree, respec- 
tively. Additionally, one can wish to have one or more roots, which are ‘essential’ 
sites required to be visited. Following this classification we will refer to the cases 
of TDO problems without roots as path (cycle, tree)-TDO and to the cases with 
roots as rooted-path (cycle, tree)-TDO. 

For illustration, consider the two following examples of possible applications 
of TDO. 

Kinetic TSP j5l lUJ . There is given a set of targets and one robot (interceptor) 
with restricted amount of resources (e.g., fuel). The dynamic of targets is known, 
i.e., for each target one can specify its location at any discrete time moment. 
The problem is to find a program for the robot which allows it to hit as many 
targets as possible within a given time. This problem is an example of path- 
TDO. If there is a target specified to be intercepted then we have an example of 
rooted-path-TD O . 

Time-dependent maximum scheduling problem (TDMS). There is given 
a set of tasks for a single machine. The execution of any task can start at any 
discrete moment and its execution time depends on the starting moment. The 
problem is to find a schedule for the machine maximizing the number of tasks 
completed within a given time period. It can be interpreted as a special case of 
TDO where the time of traveling from a site a to a site b does not depend on 
the site b and is interpreted as the execution time of the task corresponding to 
a. The Web Searching problem studied by Czumaj et al. [2] yields the following 
appealing motivation for TDMS. Assume that there is a central computer which 
is being used to collect all the information stored in a number of web documents, 
located in various sites. The information is gathered by scheduling a number of 
consecutive client/server connections with the required web sites, to collect the 
information page by page. The loading time of any particular page from any site 
can vary at different times, e.g., the access to the page is much slower in peak 
hours than in off-peak hours. We wish to download the maximum number of 
pages within a given period of time. 



1.1 Main Results 

An algorithm is said to be c- approximation algorithm for a maximization prob- 
lem P if for any instance of P it yields a solution whose value is at least 1/c 
times the optimum. 

Let n be the number of input sites and let k be the ratio between the maxi- 
mum and minimum time required for traveling between two sites. 

We present (2-|-e)-approximation algorithms for path-TDOs and cycle-TDOs 
running in time 0((2A;^(^^))!^n^^ In the corresponding rooted cases 

the time complexity increases by the multiplicative factor O(^). These bounds 
immediately carry over to the corresponding time-independent special cases. 
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i.e., the unrooted and rooted, path-orienteering problems and cycle-orienteering 
problems. Our algorithm is the first constant-factor approximation algorithm 
for TDO with k = 0(1) running in polynomial time. Although for large k, our 
algorithm can be hardly claimed to be practical because of its fairly high running 
time, it suggests that practical and efficient algorithms might be possible. 

1.2 Related Results 

The authors are not familiar with any explicit prior approximation algorithms for 
time-dependent orienteering (TDO). Of course, if the ratio between the maxi- 
mum and minimum distance is k then any approximation algorithm is a k- 
approximation one. 

For the time-dependent maximum scheduling problem (TDMS) which can be 
interpreted as a special case of TDO, a simple greedy 2-approximation algorithm 
running in time 0(mt), where m is the number of available tasks and t is the 
deadline, can be essentially deduced from Spieksma’s algorithm foi' Job 
Interval Selection Problem. Also, it follows from the same work that TDMS 
is MAX-SNP-hard. 

As for the classical, i.e., time-independent, orienteering problem, Awerbuch 
et al. proved that a c-approximation algorithm to the so called k-traveling sales- 
person problem, asking for a shortest cycle visiting k sites (k-TSP), yields a 2c- 
approximation algorithm for the orienteering problem |2| . This result combined 
with known approximation results for k-TSP yields a 6-approximation algorithm 
for the orienteering problem in metric spaces and a (2 -|- e) -approximation al- 
gorithm in the Euclidean plane. The latter result has been subsumed by Arkin 
et al. who presented 2-approximation algorithms for several variants of the ori- 
enteering problem in the plane [IJ. More recently, Broden has designed an 4/3- 
approximation algorithm for the very special case of the orienteering problem 
where the pairwise distances are constrained to {1,2} jS|. Note here that the 
recent lower bounds on the constant approximation factor for the analogously 
restricted traveling salesperson problem ^ easily carry over to the aforemen- 
tioned special case of the orienteering problem. 

For an interesting review of results related to the orienteering problem, in- 
cluding several variants of the traveling salesperson problem, the reader is re- 
ferred to p. 

The recent works by Hammar et al. IH3 and Broden |Hj contain a number 
of inapproximability and approximability results on various restrictions of the 
problem dual to TDO, i.e., the time-dependent traveling salesperson problem. 

2 Formal Definition of TDO 

For a given set S of sites, a time-travel function I : S x S xNU {0} — >■ R+ and a 
deadline t, the salesperson’s tour visiting a subset T of m sites is a sequence of 
triples 

(si, t/" , ), (s 2 ; ^2 )i • ■ • I (®fcj 
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such that 

1. For i e {1, 2, . . . ,m}, tf,t~ e NU {0}; 

2. Ur=iM = r; 

3. 0 = 4 <t^ <4 

4. For each z e {1,2, ... ,m - 1}, 4+i ~ ^7 = Si+i,t~). 

It is useful to interpret the moment t~ as the moment when salesperson leaves 
the site Si and 4 ^ the moment when salesperson enter Si. So 4+i ~ the 
time spend in travel from Si to s^+i and t~ ~4 is the time the salesperson stays 
in Si (importantly, the traveler is allowed to stay at any site any time). The 
path (or cycle) time-dependent orienteering problem is to find an open (closed, 
respectively) tour visiting maximum number of sites within the time t. 

Note that the classical orienteering problem PJ is a special case of TDO 
where for any sites o, b, the travel time from a to 6 is time-independent, i.e., 
l{sa,Sb,t') = l{sa,Sb,t") for any e [0,t]. 

3 Main Procedure and Algorithms 

We may assume w.l.o.g that the minimum travel time between two sites, 
ming s/g 5 _(/g[o_t] Z(s, s', t'), is 1. For a nonnegative integer q and positive inte- 
ger m < \t/q~\, we shall denote by Sm{<l) the subinterval [q{m — 1), min{gm, t}] 
of [0, t]. 

The following simple procedure is the heart of our algorithms for TDO. 
Procedure Greedy{S, q,t) 

1. S] 

2. for z = 1, 2, . . . , [f/q] do 

let Ti be a maximum cardinality subset of T that can be visited within 
Si{q) by a path-tour; 

visit Ti within Si{q); 

T -(^T\Ti 

Note that procedure Greedy{S,q,t) outputs for each z S (1,2, . . . , [t/g]} 
a salesperson’s path-tour for interval Si{q) but in general the output T of 
Greedy{S,q,t) is not a tour. Obviously, the set Ti can be found in 0(g!|S'|'^) 
time by considering all possible choices of a subset of T containing at most q 
elements and then applying the straightforward 0(n!)-time brute force method 
for the time-dependent traveling salesperson problem on the subset. Hence the 
overall time complexity of Greedy{S, q,t) is 0{q\\t/q'\ |S'|‘'). 

Let k be the maximum time needed to travel between two sites, i.e., 
k = rnaxg g/g 5 Z(s, s',t'). Note that by our initial assumption k is also the 

ratio between the maximum and minimum time required for traveling between 
two sites. 
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Algorithm Greedy Path{S, q, t) 

1. Run procedure Greedy{S,q,t)] 

2. for i= 1, . . . , \t/q'\ do 

Remove from the set of visited sites all sites visited during [qi—k/2, qi+k/2\, 
Glue the obtained subtours by forcing the salesperson to go from the last 
visited site in [q{i — 1) + fc/2, qi — k/2] to the first visited site in [g(i) + 
k/2, q{i + 1) — k/2] 

Algorithm GreedyGycle{S,q,t) is obtained from GreedyPath{S,q,t) by closing 
the path-route near its endpoints, i.e., by forcing the salesperson to go from the 
last visited site in s\^t/q\-i to the first visited site from [k,q — k/2]. 

4 Approximation Analysis 

Lemma 1. Let n be the maximum number of different sites that can be visited 
by \t/q] travelers in such a way that for each i € {1,2, . . . ,\t/ q]} , the i-th 
traveler operates only within the time interval Si{q). Then the number of sites 
participating in subtours produced by Greedy{S,q,f) is at least |"n/2]. 

Proof. Let W be the set of sites returned by Greedy{S, q, f), and let FF be a set 
of n sites that can be visited by the \t/q] travelers. We denote by Wi the set of 
sites visited by Greedy{S,q,t) during [0,fg] and by Wi the set 

{s G W \ Wi and s is not visited during [0,qi] by the \t/q] travellers}. 

In other words, Wi consists of the sites in W that have not been visited either 
by Greedy{S,q,t) or by the first i of the \t/q] travelers during [0, gf]. 

We claim that for each i & [1,2, ... ,\t/q]} 

\Wi]>n-2]W\ 

Since W^t/q\ = 0 this claim implies the lemma. We prove it by induction on i. 

For i = 1 this inequality directly follows from the optimality of the solution 
within [0, g] picked by Greedy{S,q,t). 

The set \ Wi (the sites in W that are eliminated during Si(q)) is the 

union of Wa U Wb (the case Wa H Wb yf 0 is possible) where: 

— Wa is the set of sites from W chosen by Greedy{S,q,t) during Si{q) 

— FFb is the set of sites in Wi-i that are visited by the i-th traveler during 
Sz{q)- 

By the definition, Wa C \ Wi_i and ]Wb\ < \Wi \ Wi_i| because of the 
optimality of the solution on Si{q). Therefore, 



|VF,_i\W,| <2|VFi\W',_i|. 
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Notice that by definition 
By induction assumption, 

m-i\>n-2\W,_,\. 



Hence, 

|1T,| = |Vb,_i| - iVb,.! \ Vb,_i| >n- 2|TT,_i| - 2|TT, \ >n- 2|iy,| 

and the claim follows. 



Lemma 2. For any £ > 0 and q — 2fc(^^), GreedyPath{S,q,t) and 
GreedyCycle{S,q,t) are {2 + e) -approximation algorithms for path- and cycle- 
TDO running in time 0((2fc^(^^))!^^|S'|^^ 

Proof. Suppose that an optimal solution to the path- (cycle-) TDO visits n 
sites. We may assume w.l.o.g. that n > t/k. The solution trivially yields \t/q]- 
traveler solution satisfying the requirements of Lemma H By Lemma Q the 
procedure Greedy{S, q,t) outputs at least | sites. Consequently, each of the 
procedures GreedyPath{S,q,t), GreedyCycle{S,q,t) outputs a tour visiting at 
least f > |(1 — sites. Hence, GreedyPath{S,q,t) and 

GreedyGycle{S, q,t)) are (2 -|- £) -approximation algorithms. 

By the remark on the time complexity of procedure Greedy {S., q, t), we obtain 
the complexity bound in the lemma thesis. 

Lemma|2|and the possibility of assuming t/k < n w.l.o.g immediately yield 
our main result. 

Theorem 1. For any e > 0, the path- and cycle- TDO for n sites admit (2-|-£)- 
approximation algorithms running in time 0((2fc^(^^))!^|-n^^ 

We can trivially model the classical path- and cycle-orienteering problems 
as special cases of TDO by setting the traveling time between two sites to the 
distance between them. 

Corollary 1. For any £ > 0, the classical path- and cycle-orienteering problems 
for n sites admit (2 -|- e)- approximation algorithms running in time 

where the distance between the furthest site pair is 
at most k times greater than that between the closest pair. 
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5 Rooted Case 

We can easily extend our technique to include the rooted case of the path- and 
cycle-TDO. Given a root s, we simply run the following modified version of 
Greedy{S, q, t). 

Procedure GreedyRoot{S, q, t, s) 

1. for j = 1,2,..., [t/g] do 

a) ^ S; 

b) for i = 1,2,..., \t/q'] do 

li i ^ j then let Ti be a maximum cardinality subset of T that can be 
visited within Si{q) by a path-tour; 

If * = j then let be a maximum cardinality subset of T containing s 
that can be visited within Si{q) by a path-tour; 

Visit Ti within Si{q)~, 

T3 ^ \ T}- 

2. Choose the set with the maximum number of sites, T TK 

By arguing analogously as in the unrooted case we can derive a rooted ver- 
sion of Lemma |21 The only essential difference is the time complexity which 
increases by the multiplicative factor of 0{t/q), i.e., 0{kn). Furthermore, the 
implementation of GreedyPath{S, q, t) and GreedyGycle{S, q, t) in the rooted 
case requires special care in order to avoid removing the root s if it happened to 
be visited in the k/2 vicinity of an Si(g)-interval border. For this purpose, it is 
sufficient to remove k sites before or after s, respectively. 

Summarizing, we obtain the following rooted version of Theorem D 

Theorem 2. For any e > 0, the rooted path- and cycle-TDO for n sites admit 
{2 -\-e)- approximation algorithms running in time 0((2fc^(^^))!^|-n^^ 



6 Final Remarks 

We have used a simple but powerful divide-and-conquer approximation tech- 
nique: ’’split” a problem in a number of ’’small” subproblems, then find exact 
solutions for each of the subproblems and ’’glue” these solutions to obtain an 
approximate solution of the problem. 

Our technique can be extended to include orienteering variants of many other 
optimizations problems (e.g., tree-orienteering) as well variants with parallel 
travelers. It can be also used in the design of efficient approximation algorithms 
for time-dependent bicriteria network optimization problems (see ini and the 
last chapter in j^j). Finally, it can be applied to derive approximative solutions 
to the Budget Prize Collecting Steiner Tree problem (see CH) and its time 
dependent variant. 
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Abstract. A mixed hypergraph is a triple {V, C, T>) where V is its vertex 
set and C and T> are families of subsets of V, C-edges and ©-edges. We 
demand in a proper colouring that each C-edge contains two vertices with 
the same colour and each ©-edge contains two vertices with different 
colours. A hypergraph is a hypertree if there exists a tree such that the 
edges of the hypergraph induce connected subgraphs of that tree. 

We prove that it is NP-complete to decide existence of a proper k- 
colouring even for mixed hypertrees with C = T> when k is given as part 
of input. We present a polynomial-time algorithm for colouring mixed 
hypertrees on trees of bounded degree with fixed number of colours. 



1 Introduction 

Hypergraphs are well established combinatorial objects, see |Q. A hypergraph is 
a pair (V,£’) where £" is a family of subsets of V of size at least 2; the members 
of V are called vertices and the members of £ are called edges. A hypergraph 
is a hypertree if there exists a tree with its vertex set equal to V such that the 
edges £ induce connected subgraphs of T. Mixed hypergraphs were introduced 
in [3 and mixed hypertrees were investigated in pj. 

A mixed hypergraph G is a triple (V, C, V) where C and V are families of sub- 
sets of V ; the members of C are called C-edges and the members of V are called 
©-edges. A mixed hypergraph is a mixed hypertree if (V,C U©) is a hypertree. 
A mixed hypergraph/tree is a mixed bihypergraph/tree if C=D. A proper k- 
colouring c of G is a mapping c :V —>{!,... ,A:} such that there are two vertices 
with ©ifferent colours in each ©-edge and there are two vertices with a Common 
colour in each C-edge. A proper colouring c is a strict k-colouring if it uses all k 
colours. The mixed hypergraph is colourable iff G has a proper colouring. Mixed 
hypergraphs can find their application in different areas: colouring block designs 
(see Pd), list-colouring of graphs and others. 

The feasible set T{G) of a mixed hypergraph G is the set of all k's such that 
there exists a strict fc-colouring of G. The (lower) chromatic number x(G) of G 

* Supported in part by GACR 201/1999/0242, GAUK 158/1999 and KONTAKT 
338/99. 

** Supported by Ministry of Education of Gzech Republic as project LN00A056 
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is the minimum number in ^(G) and the upper chromatic number x(G) of G is 
the maximum number in tF{G). The feasible set of G is gap-free (unbroken) iff 
J-{G) = [x(G), x(G)]; we use [a, b] for the set of all the integers between a and b 
(inclusively). If the feasible set of G contains a gap, we say it is broken. An ex- 
ample of a mixed hypergraph with a broken feasible set was firstly given in |^. 
On the other hand, it was proved in |2| that feasible sets of mixed hypertrees 
are gap-free. 

We address several complexity issues concerning mixed hypertrees in this 
paper. Since a feasible set of any mixed hypertree is gap-free and the chromatic 
number of a colourable mixed hypertree is at most two (it is one iff the mixed 
hypertree does not contain any 2?-edges), it is possible to restrict the complexity 
issues involving colouring mixed hypertrees to decision problems whether their 
upper chromatic numbers is at least a given number. 

The complexity problems concerning mixed hypertrees are addressed in Sec- 
tion 01 We strengthen the result of Flocos from ^ that it is NP-complete to 
determine the existence of a strict colouring with a given number of colours for 
mixed hypertrees to the case of mixed bihypertrees (Theorem EJ . We describe 
a polynomial-time algorithm for finding a strict fc-colouring (for bounded k) of 
mixed hypertrees with underlying trees of bounded degree (TheoremOl). This ex- 
tends the previous result of Voloshin (see jS|) for interval mixed hypergraphs, i.e. 
mixed hypertrees with underlying trees of degree at most two. Our polynomial- 
time algorithm is based on a new concept of /c-colourable witnesses (established 
in Section ED which is of its own theoretical interest. In particular, we get im- 
mediately as a collorary (Corollary P) that a feasible set of a mixed hypertree is 
gap-free giving a new proof completely different to the original one in P . 



2 Concept of Colourability Witnesses 

We introduce the concept of fc-colourability witnesses in this section. 

Let H = {V,C,T>) be a mixed hypertree with underlying tree T = (V,E). 
Let us consider T as a rooted tree and let level(u) be a level of its vertex v, i.e. 
the distance of v from the root. We call any sequence w\, . . . ,Wk of vertices V 
a witness sequence. We assume w.l.o.g. in the following that 0 = level(wi) < 
level(?C 2 ) < level(w 3 ) < . . . < level(?Cfe); this means in particular that wi is the 
root. 

Once we have a fc-colourability witness, we want to find some strict k- 
colouring c oi H (if it exists) such that the colours of the vertices w\,. . . ,Wk 
are mutually different. In order to do this, we extend the following simple proof 
from |0| of two-colourability of a colourable mixed hypertree H: We colour the 
root of T arbitrary and then we colour the vertices from root to the leaves — if 
a vertex form with its father a C-edge, we colour it with the colour of its father; 
if it does not form a C-edge with its father, we colour it with the colour different 
from the colour of its father. It is easy to check that this colouring is proper iff 
H is colourable. 
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In the following, we divide the vertices of H into three sets: L is the set of the 
vertices introducing new colours and their siblings, R is the set of the neighbours 
of the vertices of L and O is the set of other vertices of H. Later, we will colour 
the vertices of O as described in the two-colourability proof above. Next, we 
construct a mapping A : K — >■ {F, G, 1, . . . , A:} which assigns the number i to 
the vertex/ices introducing the colour z, to a vertex which should be coloured 
with the same colour as its father the value F and to a vertex which should be 
coloured with the colour different from the colour of its father the value G. Then, 
we describe a construction of a colouring c which is proper for some choice of 
A if there is a proper colouring of H assigning to w\, . . . ,Wk mutually different 
colours. 

Let us define the set of vertices L, R and O: 

L = {u|3z, father(v) = father(zcj) V v = Wi} 

R = {u|u ^ L A3u € L,uv € E} 

O = V\{LD R) 

We define a special set C{v) for each vertex as follows: 

C{v) = {F, G} U {z|father(v) = father(zz;j) V v = Wi} if v € L 

C{v) = {F, G} if u G i? 

C{v) = {F} if u G O and {u, father (w)} G C 

C{v) = {G} if u G O and {w, father(u)} ^ C 

We consider a mapping A : G — >■ {F, G, 1, . . . , k}] we demand that A(w) G £{v) 
for each vertex v. 

We define a colouring c of H based on a witness sequence wi, . . . ,Wk and a 
mapping A: 

— c{v) = 1 if ZU 2 is an successor of v 

— c(v) = A(v) if A(v) G {1, . . . , fc} 

— c(v) = c(father(u)) if A(u) = F 

— c(v) = c*(father(u)) if A(v) = G 

where c*(u) is following: 

— c*(u) = 2 if zz is the root (note that c(u) = 1 in this case) 

— c*(u) = c(father(zz)) if c(u) yf c(father(zz)) 

— c*(u) = c*(father(zz)) if c(u) = c(father(zz)) 

We introduce notion relating witness sequences to colourings of iJ: The se- 
quence wi, ... ,Wk of vertices K is a witness sequence of A-colourability with 
respect to a strict k' -colouring c : G — >■ k'} if c{wi) = i. A witness 

sequence is minimal if level(z(;i) is minimal. 

We say that the mapping A is consistent with a strict fc'-colouring c and a 
minimal fc-colourability witness sequence w\, . . . ,Wk (k < k') with respect to c 
if it satisfies: 
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— A(v) = c(v) if c(v) £ £(v) 

— A(v) = F if c(v) = c(father(u)) and F G £(v) 

(This holds in particular when {u, father (u)} is a C-edge.) 

— A(v) = G otherwise 

If father(w) does not exist, the second condition does not apply. 

The following theorem actually gives the sense to the previous definitions 
which introduced colourability witnesses: 

Theorem 1. Ifwi, . . . ,Wk is a minimal k- colourability witness sequence (where 
k is at least 2) with respect to a strict k' -colouring c (k < k' ) of a mixed hypertree 
H and if A is consistent with c, then c corresponding to wi, . . . ,Wk and X is a 
strict k -colouring. 

Proof. The colouring c clearly uses exactly k colours. It remains to prove that c 
is proper. Thus it is enough to prove that each edge of T> contains two vertices 
coloured by c with different colours and each edge of C contains two vertices 
coloured by c with the same colours. 

Let E be any edge of T> and let u and v be two vertices coloured by c with 
different colours. We can assume w.l.o.g. that u and v are neighbours and that 
e.g. u = father(u). Note that W 2 is not an successor of v, since otherwise it would 
hold that c{u) = c{v) = 1 due to the minimality of the witness sequence and 
our assumption that 0 = leve^ici) < level(w 2 ) < . ■ . < level(wfc). We claim that 
c{u) yf c{v). We distinguish several cases: 

- X{v) = F 

This case is impossible due to consistency of A with c. 

- X{v) = G 

It holds that c(u) = c*(father(w)) c(father(u)) — note that W 2 is not an 
successor of v since otherwise c{u) = c(v) = 1 due to the minimality of the 
witness sequence. 

- A(v) G fc} 

There is no predecessor of v coloured by c with the colour X(v) = c(v) due to 
the minimality of the witness sequence; it especially holds that c(u) yf c(v), 
in this case. 

We have just proved that all the P-edges contain two vertices coloured by dif- 
ferent colours. So, we focus our attention on the C-edges. 

Let E be any edge of C. If if contains two vertices whose successor is W 2 , then 
these two vertices of E are coloured by c with colour 1. We assume further that 
there is at most one vertex whose successor is W 2 ] clearly this vertex, if it exists, 
is the vertex of E which is the nearest one in E to the root. Let u and v be two 
nearest vertices of E coloured by c with the same colour. It is not necessary that 
c{u) = c{v). We first state a useful observation which is going to be used several 
times in the proof: 



Observation 1. If E contains a vertex u such that the both following conditions 
hold: 
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— \{u)e{F,G} 

— {u,father('u),father(father(u))} C E or {m, father(u)} = E 

Then E contains two vertices coloured by c with the same colour. 

The case that A(u) = F is trivial. If A(u) = G, then E cannot be {u, father(u)} 
due to the definition of A(u) and it holds that {u, father(u), father(father(u))} C 
E. Let u' = father(u) and u" = father(w') = father(father(u)). If c{u') = c{u") 
the claim is clear. If c(u') yf c{u"), then c{u) = c*{u') = c{u") and the claim also 
holds. 

We continue the proof of the theorem. We distinguish several cases to prove 
that E contains two vertices coloured by c with the same colour: 

— The vertex u is a predecessor of u (in particular level(u) > level(u)); 
the case that u is a predecessor of v is symmetric. 

In this case E contains all the vertices between u and v, i.e. it contains 
father('u) in particular. We distuinguish several cases: 

• \{u) = F 

Then u and father(u) are coloured with the same colour in this case. 

• A(u) = G 

If V is not a father of u, then E contains two vertices coloured by c with 
the same colour due to the observation. Let us consider the remaining 
case that v is the father of u (and thus c{u) yf c(u)). Due to consistency 
of A with c, it must hold that F ^ £{u) and thus u £ O. Then, E must 
contain at least three vertices, in particular it contains either a son of u, 
a brother of m or a grand-father of u: 

* E contains a son of u. Gall it w. 

Glearly w £ R\J O {ii w £ L, then u £ R and A(u) = F) and E 
contains two vertices coloured by c with the same colour due to the 
observation used for w. 

* E contains a brother of u. Gall it w. 

Glearly w £ R\J O {li w £ L, then also u £ L) and thus \{w) is 
either F or G. If \{w) = F, then c{w) = c(v) and these v and w are 
two vertices of E coloured by c with the same colour. If \{w) = G, 
then c{u) = c{w) = c*{v) and thus u and w are two vertices of E 
coloured by c with the same colour. 

* E contains a grand father of u. 

In this case E contains two vertices coloured by c with the same 
colour due to the observation. 

• A(u) G 

If it held that A(u) = i, then we could get by substituting Wi (a member 
of the witness sequence with leve^iCi) = level('u) > level(u)) with v a 
fc-colourability witness sequence with a smaller level sum, contradicting 
the minimality of the witness sequence. 

Neither u is a predecessor of u or u is a predecessor of u. 

Let w be the nearest common predecessor of u and v. We distinguish several 
cases to prove that E contains two vertices coloured by c with the same 
colour: 
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• \{u) = F or \{v) = F 

The appropriate vertex and its father are coloured by c with the same 
colour and thus E contains two vertices coloured by c with the same 
colour. 

• \{u) = G and A(u) = G 

If father(rt) w or father(u) w, then E contains two vertices of 
the same colour due to the observation; otherwise it holds that w = 
father(r() = father(u). Then, it holds that c{u) = c(v) = c*(w) and thus 
u and V are two vertices of E coloured by c with the same colour. 

• \{u) £ A:} and A(v) = G (the case that A(u) = G and A(v) £ 

{1, . . . , fc} is symmetric) 

If u and V are brothers, then due to the consistency of A with c, it has to 
hold that A(m) = A{v). Thus u and v are not brothers. If father(u) w, 
then E contains two vertices of the same colour due to the observation. 
Otherwise since u and v are not brothers, w = father(u) and w is the 
nearest common predecessor of u and v, the level of u is greater than 
the level of v, i.e. level(M) = level(wi) > level(u). But this means that we 
can get by substituting Wi with v (c(u) = c(wi) = c(v)) a fc-colourability 
witness sequence with a smaller level sum, contradicting the minimality 
of the witness sequence. 

• A(u) £ {1 , . . . ,k} and A(u) € {1, . . . , A:} 

If u and V are brothers, then due to the consistency of A with c, it holds 
that A(u) = A(v) and thus c(u) = c(v) and thus u and v are two vertices 
of E coloured by c with the same colour. 

If u and V are not brothers, then C{u) yf C{v). But due to the consistency 
of A with c and the fact that A(u) £ {1, . . . ,kj and A(v) S {I, . . . , A;}, it 
has to be c(u) y^ c(v). But this contradicts our choice of u and v, since 
we have chosen them to hold that c{u) = c(v). 

Corollary 1. If a hypertree H has a strict k' -colouring, then it has also a strict 
k-colouring for all 2 < k < k' . In particular, the feasible set of any mixed 
hypertree is gap-free. 

Proof. Let c be a strict A;'-colouring of H and let w\, . . . ,Wk be any minimal 
A:-colourability witness sequence of H with repsect to c. Let A be consistent with 
c. Then the colouring c corresponding to the witness sequence Wi, . . . ,Wk and 
mapping A is a strict A:-colouring. 

It is enough to apply this corollary for k' = x{H) to get that its feasible set 
is gap-free. 

3 Complexity Problems Dealing with Mixed Hypertrees 

We first state our negative result concerning the colouring of mixed hypertrees: 

Theorem 2. It is NP-complete to decide whether a given mixed bihypertree is 
strictly k-colourable for a given k. 
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Proof. The reduction from the decision problem on independence number of 
cubic graphs is presented: Let G be any cubic graph with n vertices wi, . . . , Wn- 
Let T be a tree with the vertex set equal to V = {*, Mi, . . . , Ui, . . . , u„} and 
the edge set equal to {-kUi, UiVi\l We define a mixed bihypertree H on 

the vertex set V as follows: The edges of H are precisely the sets {k,Ui,Vi, Uj} for 
all i and j such that WiWj is an edge of G (since iL is a mixed bihypertree, all its 
edges are both C and P-edges). We claim that x(iL) = n + a + 1 where a is the 
size of the independent set of G. This claim implies the stated NP-completeness 
of the problem. Note that each edge of the tree T is contained either in 6 or 12 
edges of H (the edges incident to the root are contained in 12, the other edges 
in 6 edges oi H). 

We prove the claim, now. Let Wjj , . . . , Wi^ be an independent set of G. We 
can construct the colouring c of the vertices of H using exactly n + a + 1 colours 
as follows: c{vj) = j, c{ui.) = n + j, c(*) = 0 and c{uj) = 0 for j yf ii, . . . ,ia- 
Let {k,Ui,Vi,Uj} be an edge of H. Either c{ui) = 0 or c{uj) = 0, since both Wi 
and Wj cannot be in the considered independent set and c(*) = 0 yf c{vi). This 
assures that the constructed colouring c is proper. 

On the other hand, let c be a colouring using n + a+1 colours. We construct 
an independent set of G of size a. Let Rq = k, . . . , Rn+a (we can assume w.l.o.g. 
that c(*) = 0) be the minimal (n + a + l)-colourability witness sequence with 
respect to T rooted at *. We first define a new strict colouring c using n + a + 1 
colours: 

— We set c(k) = 0. 

— If both Ui and Vi are among Ri, . . . , Rn+a (a ray of Type 1), we set c{ui) = 
c(ui) and c(vi) = c(u*). 

— If only Ui is among i?i, . . . ,Rn+a (a ray of Type 2), we set c{ui) = 0 and 
c(^'^) = c{ui). 

— If only Vi is among i?i, . . . ,R„+a (a ray of Type 3), we set c{ui) = 0 and 
c(^'^) = c{Vi). 

— If neither Ui nor Vi is among Ri, . . . , Rn+a (a ray of Type 4), we set c{ui) = 0 
and c(vi) to a completely new colour. 

Note that if c uses x(iL) colours, the last case cannot occur. Let {k,Ui,Vi,Uj} 
be an edge of H] c(k) ^ c(vi) due to the definition of c. If Ui and Uj belong both 
to rays of Type 1, then the original colouring c cannot be proper. Thus at least 
one of them belongs to a ray of Type 2, 3 or 4 and is coloured by 0, the same 
colour as the vertex *. We have just proven that c is a proper colouring. Let 
A = {wi\c{ui) yf c(*)}. The set A is an independent set of G, since c is a proper 
colouring, and its size is exactly a, since vertices *, ui, . . . are coloured by 
mutually different n + 1 colours. This finishes the proof of the claim. 

We describe the promised algorithm for colouring mixed hypertrees in the 
next theorem: 

Theorem 3. If k is hounded and the maximum degree of tree T is hounded, it 
can he decided in polynomial time whether a given mixed hypertree H on T is 
strict k-colourahle. 
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Proof. Consider the following algorithm: 

for all sequences w\, . . . ,Wk of vertices of H such 

that 0 = level(w;i) < level(t<; 2 ) < ■ • ■ < level(wfe) 
do 

construct sets L, R and O for . . . ,Wk 
construct sets C{v) for all vertices v 

for all mappings X : V ^ {F, G, 1, . . . , fc} such that \{v) £ £{v) 
do 

construct c for wi, . . . ,Wk and A 
if c is a proper colouring then 
output YES 
halt 

done 

done 

output NO 
halt 



It is clear that if the algorithm outputs YES, then H has a strict fc-colouring. 
Suppose that the algorithm outputs NO and H has a strict fc-colouring — call 
this colouring c. But there certainly exists a minimal witness sequence w\, . . . ,Wk 
corresponding to c and the mapping A consistent with c. But in the loop when 
the algorithm considered this witness sequence w\, . . . ^Wk and this mapping A, 
the constructed colouring c would be strict and thus the algorithm would output 
YES. But since this did not happen, the strict fc-colouring coi H cannot exist. We 
have just proven that the algorithm outputs YES iff H has a strict fc-colouring. 

We give an estimate for running time of the algorithm now. Let d be a 
maximal degree of T. There are choices of the witness sequence in the 

first step. The sets L, R, O and C{v) can be constructed in polynomial time 
ti(n, k) given wi, . . . ,Wk where n is the number of vertices of H . We write m 
for the number of edges of H in the next. It is clear that \L\ < k{d — 1) < kd 
and \R\ < k{d — 1)^ + k < kdf . There are exactly |£(r')| < k + 2 choices of 
X(v) for each v € L, two choices of X{v) for each v £ R and one choice of 
X{v) for each v £ O. Thus it holds that IIy^L\C{v)\ < {k + 2)^^^ and 21-^1 < 
2 kd ^ colouring c can be constructed and it can be checked whether c is 
proper in polynomial time t 2 (n, m, k). The running time of the whole algorithm 
is k) {k+2)^'^2^'^'^ t 2 {n, m, k)) — this can be clearly majorized 

by a polynomial in n and m if k and d are bounded. 



Open Problems 

The most interesting still open problem is the decision problem of the existence 
of a strict ^-colouring of a given mixed hypertree for fixed fc: Is it NP-complete 
to decide whether there exists a strict ^-colouring of a given mixed hypertree 
even for fixed fc? There are several other open algorithmic problems concerning 
colouring mixed hypertrees raised in j^. 
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Combining Arithmetic and Geometric Rounding 
Techniques for Knapsack Problems* 
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Abstract. We address the classical knapsack problem and a variant in 
which an upper bound is imposed on the number of items that can be 
selected. We show that appropriate combinations of rounding techniques 
yield novel and powerful ways of rounding. As an application of these 
techniques, we present a faster polynomial time approximation scheme 
requiring only linear storage, that computes an approximate solution of 
any fixed accuracy in linear time. This linear complexity bound gives 
a substantial improvement of the best previously known polynomial 
bound | 2 |. 



1 Introduction 

In the classical Knapsack Problem (KP) we have a set N := {1, . . . ,n} of items 
and a knapsack of limited capacity. To each item we associate a positive profit 
Pj and a positive weight Wj. The problem calls for selecting the set of items 
with maximum overall profit among those whose overall weight does not exceed 
the knapsack capacity c > 0. KP has the following Integer Linear Programming 
(ILP) formulation: 



maximize E PjXj (1.1) 

j€N 

subject to '^j^j ^ c (1-2) 

j€N 

a;, G{0,1}, jGN, (1.3) 

where each binary variable Xj, j G N, is equal to 1 if and only if item j is 
selected. In general, we cannot take all items because the total occupancy of the 
chosen items cannot exceed the knapsack capacity c. In the sequel, without loss 
of generality, we assume that ^ ^ ^ every j € N. 

The k-item Knapsack Problem (kKP), is a KP in which an upper bound of 
k is imposed on the number of items that can be selected in a solution. The 

* Supported by the “Metaheuristics Network”, grant HPRN-CT-1999-00106, and by 
Swiss National Science Foundation project 21-55778.98, “Resource Allocation and 
Scheduling in Flexible Manufacturing Systems”. 
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problem can be formulated as (Ej-ca) with the additional constraint 

jGN 



with 1 < k < n. 

KP has widely been studied in the literature, see the book of Martello and 
Toth |B| for a comprehensive illustration of the problem. kKP is the subprob- 
lem to be solved when instances of the Cutting Stock Problem with cardinality 
constraints are tackled by column generation techniques. kKP also appears in 
processor scheduling problems on computers with k processors and shared mem- 
ory. Furthermore, kKP could replace KP in the separation of cover inequalities, 
as outlined in | 2 |. 

Throughout our paper let OPT denote the optimal solution value to the 
given instance and w{F) = P(^) — where F C N. 

An algorithm A with solution value is called an e- approximation algorithm, 
e € (0, 1), if > (1 — e)OPT holds for all problem instances. We will also call 
£ the performance ratio of A. 

Known Results. It is well known that KP is NP-hard but pseudopolynomially 
solvable through dynamic programming, and the same properties hold for kKP 

0. Basically, the developed approximation approaches for KP and kKP can be 
divided into three groups: 

1. Approximation algorithms. For KP the classical | -approximation algorithm 
(see e.g. j7j) needs only 0{n) running time. A performance ratio of ^ can be 
obtained also for kKP by rounding the solution of the linear programming 
relaxation of the problem (see | 2 |); this algorithm can be implemented to run 
in linear time when the LP relaxation of kKP is solved by using the method 
by Megiddo and Tamir jOj. 

2. Polynomial time approximation schemes (PTAS) reach any given perfor- 

mance ratio and have a running time polynomial in the length of the encoded 
input. The best schemes currently known requiring linear space are given in 
Caprara et al. |2|: they yield a performance ratio of e within 0 ( 71 ^ 1 /^! - 2 ) 
and running time, for KP and kKP respectively. 

3. Fully polynomial time approximation schemes (FPTAS) also reach any given 
performance ratio and have a running time polynomial in the length of the 
encoded input and in the reciprocal of the performance ratio. This improve- 
ment compared to 2 . is usually paid for by much larger space requirements, 
which increases rapidly with the accuracy e. The first FPTAS for KP was 
proposed by Ibarra and Kim 0, later on improved by Lawler 0 and Kellerer 
and Pferschy 0 . In Caprara et al. |2| it is shown that kKP admits an FPTAS. 

New Results. Rounding the input is a widely used technique to obtain poly- 
nomial time approximation schemes 0. Among the developed rounding tech- 
niques, arithmetic or geometric rounding are the most successfully and broadly 
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used ways of rounding to obtain a simpler instance that may be solved in poly- 
nomial time (see Sections rn\ and O for an application of these techniques to 
kKP). Our paper contains a new technical idea. We show that appropriate com- 
binations of arithmetic and geometric rounding techniques yields to novel and 
powerful rounding methods. To the best of our knowledge, these techniques have 
never been combined together. By using the described rounding technique, we 
present an improved PTAS for kKP requiring linear space and running time 
0(n-|- k ■ Our algorithm is clearly superior to the one in j2j, and it 

is worth noting that the running time contains no exponent on n dependent on 
e. Since KP is a special case of kKP, we also improve the previous result for KP. 

2 Rounding Techniques for kKP 

The aim of this section is to transform any input into one with a smaller size 
and a simpler structure without dramatically decreasing the objective value. The 
main idea is to turn a difficult instance into a more primitive instance that is 
easier to tackle. This will help for efficient enumeration. 

In the following we discuss some general techniques that are applied through- 
out our paper. We discuss several transformations of the input problem. Some 
transformations may potentially increase the objective function value by a factor 
of 1 — 0{e), so we can perform a constant number of them while still staying 
within 1 — 0(e) of the original optimum. Others are transformations which do not 
increase the objective function value. When we describe the first type of trans- 
formation, we shall say it produces 1 — 0{e) loss, while the second produces no 
loss. 

Let denote the solution value obtained in 0(n) time by employing the 
1 /2-approximation algorithm for kKP described in j^. In Pj, it is shown 

that 






(2.5) 



where Pmax = max^- pj . 

Throughout this section we restrict our considerations to feasible solutions 
with at most 7 items, where 7 is any positive integer not greater than k. The first 
observation is that at most an e-fraction of the optimal profit OPT is lost by 
discarding all items j where pj < eP^/y, since at most 7 items can be selected 
and P^ < OPT. From now on, consider the reduced set of items with profit 
values greater than eP^/ 7 . 

In order to reduce further the set of items, the useful insight is that when 
profits are identical we pick them in order of their weights. Since the optimal 
profit is < 2P^ , among all items with profit value p G {pi, ...,p„}, we can keep 



the first n = min 




items with the smallest weights, and discard the 



others with no loss. Of course, we cannot hope to obtain a smaller instance if 
all profits are different. In the following, we show how the number of different 
profits can be reduced by rounding the original profits. We revise two rounding 
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techniques and show that an appropriate combination of both leads to a better 
result. We will call a profit value p large if p > and small otherwise. 



2.1 Arithmetic Rounding 

A sequence 01 , 02 ,... is called an arithmetic sequence if, and only if, there is a 
constant d such that Oi = oi +<i- (t — 1), for all integers z > 1. Let us consider the 
arithmetic sequence S'a( 7 ) obtained by setting oi = d = eP^ /^. We transform 
the given instance into a more structured by rounding each profit down to the 
nearest value among those of Sa{j)- Since in the rounded instance the profit of 
each item is decreased by at most eP^/ 7 , and at most 7 items can be selected, 
the solution value of the transformed instance potentially decreases by eP^ . Of 
course, by restoring the original profits we cannot decrease the objective function 
value, and therefore, with 1 — e loss, we can assume that every possible profit is 
equal to oi = for some z > 1. Furthermore, since Pmax = maxj^jsfPj < P^, 

the number of different profits is now bounded by < [fj- 

The largest number tz, of items with profit Oj, for z=l,..., [^J, that can be 
involved in any feasible solution is bounded by 



TZj < min{ 7 . 



OPT 




} < min{ 7 , 



27 

si 






and we can keep the first zz^ items with the smallest weights, and discard the 
others with no loss. Now, the number of items with profit ai is at most 7 , 
if Qi is a small profit (i.e. when i = 1 ,..., [|J), and at most otherwise 
(z = [|J + 1 , ..., [gj)- Thus, by applying the described arithmetic rounding we 



I ~ I 

have at most [ 2 /eJ 7 items with small profits and Le?J large 

profits. Recall that when a summation can be expressed as where 

/(fc) is a monotonically decreasing function, we can approximate it by integral 
(see, e.g. |2| p. 50): Y^k=xfW — /(^)^^- Furthermore, we are assuming 

that 0 < e < 1, and recall that ln(l + a:) > a;/(l + x), for cc > — 1. Therefore, the 
total number of items in the transformed instance is bounded by 



2 

e 



LiJ 

7+ ^ 



27 

si 




1 

i 



<?2(l+/’' ^)= ?I(i + ln7-ln(2-£)) 
e 7|-i * e 

< — (l + ln7- ^ < — (l + ln7) 

£ 2 — 6 € 

= 0 ( — In 7 ). 

£ 



We see that by applying the described arithmetic rounding we have at most 2j/e 
items with small profits and ^ Iny with large profits. 
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2.2 Geometric Rounding 

A sequence 01,02,... is called a geometric sequence if, and only if, there is a 
constant r such that Oj = oi • for all integers z > 1 . Let us consider 

the geometric sequence S'g(7) obtained by setting oi = eP^/7 and r = 

We round each profit down to the nearest value among those of Sgi^j). Since 
Oi = (1 — £)Oi_|_i, for z > 1, each item profit is at most decreased by a factor of 
1— £, and consequently, the solution value of the transformed instance potentially 
decreases by the same factor of 1 — e. Therefore, with 1—e loss, we can assume that 
every possible profit is equal to o^ = for some z > 1 . Furthermore, 

since Pmax < j the number of different profits is bounded by the biggest 
integer (3 such that 

^pH 1 

— . (— < pff. 

7 

Since In(Y^) > £, we have /3 — 1 < < yln^. In any feasible solution, 

the largest number Ui of items with profit Oj, for z = is bounded by 



. OPT 27 

rz*<min{7, ^ | < mm{7, — (1-e) |, 

_ 7 ^ 1 — £■ ' - ^ 

and we can keep the first nt items with the smallest weights, and discard the 
others with no loss. Let a = + 1 . Again, the number of items with 

profit Qi is at most 7, if is a small profit (i.e. when z = 1, ...,a), and at most 
[—(1 — otherwise (z = o;+ 1 , ...,/ 3 ). Therefore, the total number of items 

in the transformed instance is bounded by 




< (- ln( 2 /£) + 1)7 + ^ = 0 (^ In -). 

£ £ £ £ 



We see that by applying the geometric rounding we have at most j/e items with 
large profit, while O(^lnl) items with small profits. Contrary to arithmetic 
rounding, the set of items that has been reduced most is the set with large 
profits. This suggests us to combine the described rounding techniques. 

2.3 Arithmetic & Geometric Rounding 

We use arithmetic rounding for the set of items with small profits and geomet- 
ric rounding for large items. More formally, let us consider the Arithmetic & 
Geometric sequence Sag{j) = (01,02, ...) defined by setting 

_ eP^ ( i for z = 1, ..., [2/£j , 

~ j ((137)“''’*“^^^®^“^ otherwise. 
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We round each profit down to the nearest value among those of Sag{j)- Now, 
consider every set Ni of items with the same rounded profit value Ui, and take 
the first 



_J7 for i = 1, [2/eJ , 

* \ [^(1 — £)“■*"*“ L2/eJ-iJ otherwise, 

items with the smallest weights. Selecting the first Ui items with the smallest 
weights can be done in 0(|7Vi|) time. That is, 0(|A^i|) time is sufficient to select 
the rii-th item with the smallest weight (see (Q) and only 0(|A^i|) comparisons are 
needed to extract the — 1 items with smaller weight. Therefore the amortized 
time is linear. 

By using the arithmetic rounding technique for small items, we have at most 
27/e small items with 1 — e loss (see Section E~Tll . While, by using the geometric 
rounding technique described in Section 12.21 for large items, we have at most 
7/e large items with 1 — e loss. The resulting transformed instance has at most 
37/e items with 1 — 2e loss. Furthermore, let tp = P — a + [2/eJ + 1. Observe 
that the i/'-th element of Sagij) is not smaller than , i.e., . Consider 

any subset S Q N of items with at most 7 items, and let Xi denote the total 
number of items from S with profit in interval [oi,ai+i), i = 1,2, ...ip. Let us 
call vector {xi,X2, ...,a;,^) an S -configuration. It is easy to see that by using the 
reduced set of items it is always possible to compute a solution having the same 
^'-configuration, for any S Q N with 7 items. Summarizing: 

Lemma 1. For any positive integer j < k, it is possible to eompute in linear 
time a reduced set C N of items with at most 37/e items, such that, for any 
subset S N with at most 7 items, there exists a subset S.y C such that S.y 
is the subset of N having the same eonfiguration as S and with the least weights. 



Corollary 1. For any subset S Q N with at most 7 items, there exists a subset 
S.y C N.y with w{Sj) < w{S), = liS”! and p{S.y) > p{S) — 2e • OPT. 

3 A Faster Linear-Storage PTAS for kKP 

Our PTAS for kKP improves the scheme of Caprara et al. 0, and in fact it 
strongly builds on their ideas. However, there are several differences where a 
major one is the use of two reduced sets of items instead of the entire set JV: let 
£ := min{|"l/e] —2,k}, where e < 1/2 is an arbitrary small rational number; our 
algorithm uses sets At/, and Af computed by using the Arithmetic & Geometric 
rounding technique (see Lemma^) when 7 := fc and 7 := £, respectively. We will 
show that sets At/, and At£ help for efficient enumeration. 

For any given instance of kKP, the approximation scheme performs the 
following five steps (S-l)-(S-5). 

(S-1) Initialize the solution A to be the empty set and set the corresponding 
value P^ to 0. 
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(S-2) Compute the reduced sets Nk and N^. 

(S-3) Consider each L C Ng such that \L\ < i — 1. If w{L) < c and p{L) > 
let A := L, := p{A). 

(S-4) Consider each L C N{, such that \L\ = i. If w(V) < c, consider sequence 
Sag{^) = (aij 02 ,...) and let h be an integer such that ah < min^gip^- < 
tth+i- Apply algorithm Hi to the subinstance S defined by item set {i G 
Nk\L : Pi < Uh+i}, by capacity c — w{L) and by cardinality upper bound 
k—i. Let T and P^ (S) denote the solution and the solution value returned 

hy Hi when applied to S. If p{L) + P^{S) > P^ let A := L U T and 
pA — 

(S-5) Return solution A of value P^. 

Observe that in steps (S-3) and (S-4), subsets L are computed by considering 
just the items from Ni. On the other hand, in step (S-4), we remark that the 
subinstances S are defined by using items from Nk- 

3.1 Analysis of the Algorithm 

Step (S-2) can be performed in 0(n) time by Lemma E In step (S-3) the al- 
gorithm considers 0(|A^|^“^) subsets, for each one performing operations that 
clearly require 0{£) time. In step (S-4) the algorithm considers 0(|A^|^) sub- 
sets L. For each L the definition of subinstance S requires 0{\Nk\ ■ tj time. 
Algorithm H^ applied to subinstance S runs in OdS”!) = 0(|Afe|) time 0-By 
Lemmad \Nk\ = 0{k/s) and |Af| = 0{£/e). Therefore, step (S-4) is performed 
in 0(|Ad^ • |A^fc| • £) = 0{k ■ (f)^+^) = k ■ (l/e)°(i/^). It follows that the overall 
running time of the algorithm is 0{n + k ■ (l/e)*^*^^/®) and it is not difficult to 
check that steps (S-l)-(S-5) require linear space. 

What remains to be shown is that steps (S-l)-(S-5) return a (1 — 0{e))~ 
approximate solution. 

If an optimal solution contains at most i items, the solution returned is 
(1 — 2e)-approximate solution. Indeed, by Corollary Q] the reduction of N to Ng 
results in at most (1 — 2e) loss of profit, and steps (S-3)-(S-4) consider all subsets 
of having at most I items. 

Otherwise, let {ji, ..., ...} be the set of items in an optimal solution ordered 

so that pj^ > ... > Pje > ..., and let L* = {ji, be the subset obtained by 

picking the first i items with the largest profit. Consider subinstance S* defined 
by item set {i G N\L* : pi < min^-gj^* by capacity c — w{L*) and by 
cardinality upper bound k — £. Clearly, 

p(L*) + OPTs-^ = OPT, (3.6) 

where OPTs* denotes the optimal value of instance S*. Now, consider the 
reduced set Nk and subinstance 5^ defined by item set {i G Nk\L* : pi < 
min^gi* pj}, by capacity c — w{L*) and by cardinality upper bound k — £. By 
Corollary P we have 

OPTsi > OPTs> - 2eOPT, 
where OPTs* denotes the optimal value of instance S* . 



(3.7) 
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Let us use L to denote the set of items having the same configuration as L* 
and the least weights. By Lemma in one of the iterations of step (S- 4 ), set 
L is considered. In the remainder, let us focus on this set L, and consider the 
corresponding subinstance S defined in step (S- 4 ). By Corollary ^ we have 

p{L) > p{L*) - 2 eOPT. ( 3 . 8 ) 

We need to show that the optimal solution value OPTs of instance S cannot be 
smaller than OPTs* ■ 

Lemma 2 . OPTs > OPTs*. 

Proof. Recall that the subinstance S is defined by item set Ig = {* G Nj^\L : 
Pi < Ofi+i}, where at+i is a term of sequence Sag{i) = ( 01 , 02 ,...) such that 
O'h < rniUj-giPj < Uh+i (see step (S- 4 )). On the other hand, the subinstance 
is defined by item set Is* = {i € Nk\L* : pi < min^g^. pj}. Since we are 
assuming that L has the same configuration as L* , there are no items from L* 
with profit in intervals [oj,Oi+i), for i < h. Therefore, we have min^g^. > Oh 
and {i G Nk\L : pi < an} = {i € Nk\L* : pi < an] = {i & Nk '. Pi < an]. 
Furthermore, since there is an item from L with profit in interval [o;,, o/j+i), and 
since L* has the same configuration as L, there exists an item from L* with 
profit pj < Uh+i and, therefore, minj^s* Pj < ah+i. It follows that Is* C G 
Nk\L* : Pi < au+i}. 

By the previous arguments, the items of S^, except for those belonging to 
Ah = {i & Nk r\ L : Oh < Pi < o^+i}, are also items of S, i.e.. 

Is* Q Is O Ah. 

If there exists an optimal solution for S^. such that no one of the items from 
Ah is selected, then OPTs > OPTs^, since the knapsack capacity of is not 
greater than the one of S, i.e. c — w{L) > c — w{L*) (recall that L is the subset 
having the same configuration as L* with the least weights). 

Otherwise, let Gi be the subset of items from Ah in an optimal solution for 
Sh, and let g := |Gi|. Let G2 be any subset of set {i G L*\L ■ Oh < Pi < ah+i} 
of exactly g items. It is easy to see that G2 exists. Furthermore, since G2 C L* 
and Gi C Is* , we have 



min Pi > maxn,-. ( 3 . 9 ) 

jeG2 jGGi 

Observe that w{L*) — w(L) > w{G2) —w{G\). Therefore, the knapsack capacity 
c — w{L) of S cannot be smaller than c — w{L*) + w{G2) — w{Gi). The solution 
Gi 2 obtained from the optimal solution for by replacing the items from Gi 
with those from G2, requires a knapsack of capacity bounded by c — w{L*) + 
w{G 2) — w{Gi). Therefore, G12 is a feasible solution for S since the capacity of 
S is greater than the capacity of S^. by at least w{G2) — w{Gi). Finally, from 
inequality 113. till , the solution value of G12 is not smaller than OPTs* and the 
claim follows. 
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Let P^{S) denote the solution value returned hy when applied to S. 
Then we have the following 

Lemma 3. p{L) + P^{S) > (1 — Ae)OPT . 

Proof. Observe that by Lemma El and inequality lid.Yil . we have 

OPTs > OPTs- - 2eOPT. (3.10) 

We distinguish between two cases. 

1. If p{L*) > (1 — e)OPT then by inequalities (tLtill . (13. 8t and (13.101 . we have 

p{L) + P^{S) > p{L*) - 2eOPT + \oPTs 

> ^(2p(L*) + OPTs.) - 3e • OPT 

> ^(p(T*) + OPT) - 3£ • OPT 

> (1 - Ej2)OPT - 3e • OPT 

> (l-7/2s)- OPT. 

2. If p(T*) < (1 — e)OPT then the smallest item profit in L*, and hence every 
item profit in S*, is smaller than OPT. The biggest profit of S is at 
most ^^f^^ OPT + Indeed, since ah < ^^f^^ OPT < 2(1 — e)^ < 
( [|J — 1)^^, it turns out that h < [f J — 1, and by definition of Sag{i), we 
have that Qh+i = o/j + Therefore, for each item j belonging to S, profit 
Pj is bounded by 

OPT 

n<—- 

Since OPTs — P^{S) < ma,Xj^sPj (see inequality dSJ), we have 
OPT 

p{L) + P^{S) + — > p{L) + OPTs 

> p{L*) + OPTs- - 4e ■ OPT = (1 - 4e)OPT. 

By the previous lemma, steps (S-l)-(S-5) return a solution that cannot be 
worse than (1 — 4e)OPT. Thus, we have proved the following 

Theorem 1. There is a PTAS for the k-item knapsack problem requiring linear 
space and 0{n + k ■ (l/e)'^^^/^^) time. 

To compare our algorithm with the one provided in 0 notice that the running 
time complexity of the latter is whereas our scheme is linear. As in 

E], our algorithm can be easily modified to deal with the Exact k-item Knapsack 
Problem, that is a kKP in which the number of items in a feasible solution must 
be exactly equal to k. The time and space complexity and analysis of the resulting 
algorithm are essentially the same. 
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The Complexity of Maximum Matroid-Greedoid 

Intersection* 



Taneli Mielikainen and Esko Ukkonen 
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FIN-00014 University of Helsinki, Finland 



Abstract. The maximum intersection problem for a matroid and a gree- 
doid, given by polynomial-time oracles, is shown AP-hard by expressing 
the satisfiability of boolean formulas in 3-conjunctive normal form as 
such an intersection. Also the corresponding approximation problem is 
shown AP-hard for certain approximation performance bounds. This 
is in contrast with the maximum matroid-matroid intersection which is 
solvable in polynomial time by an old result of Edmonds. 



1 Introduction 

A set system (S', F) where S is a finite set (the domain of the system) and F is 
a collection of subsets of S is a matroid if 

(Ml) 0 G F; 

(M2) If r C A G F then A G F; 

(M3) If A, A G F and |A| > |A| then there is an a; G A \ A such that 
AU{a;} G F. 

A greedoid is a set system (S, F) that satisfies (MI) and (M3). 

In applications a matroid or a greedoid is given by an oracle, i.e., by a deter- 
ministic algorithm that answers the question whether A belongs to F for any 
ACS. 

Many combinatorial problems can be formulated using matroids or greedoids 
(see e.g. 0E1). The seminal example is the maximum matching problem in bipar- 
tite graphs. Each instance of the problem can be represented as the intersection 
of two matroids. The maximum matching corresponds to the largest set in the 
intersection. 

To be able to consider in the matroid-greedoid framework general combi- 
natorial problems which have infinitely many instances we introduce families of 
matroids and greedoids that have uniform polynomial-time representation. More 
formally, let T — {{Sh,Fh)hsH} be a possibly infinite set of matroids or gree- 
doids. Then T is said to be given by a uniform polynomial-time oracle if there 
is an algorithm O, that when given h and some X Q answers whether or not 
A G F/j. The algorithm O runs in polynomial time in |/i| and |A|. 

* A work supported by the Academy of Finland. 
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Let T = {{Sh,Fh)h€H} and G = {{Sh,Gh)h(^H} be two such families given 
by uniform polynomial-time oracles. Note that the index set H is the same for 
both and for given h, both have the same domain Sh- 

The maximum intersection problem for T and Q is to find, given an index 
h G H, a. set X G Ffi r\ Gfi such that |X| is maximum. 

Edmonds 0 gave the first polynomial-time solution for the intersection prob- 
lem in the case that both T and Q are families of matroids. In this paper we 
consider the obvious next step, namely the intersection of families of matroids 
and greedoids. 

In Section El we show, by reduction from the 3SAT, that the maximum in- 
tersection problem for a matroid family and a greedoid family, given by uniform 
polynomial-time oracles, is NP-hard. In Section 01 we modify the above con- 
struction to show that the maximum matroid-greedoid intersection is not ap- 
proximable within a factor 15/11^“*^ for any fixed e > 0 and its weighted version, 
the maximum weight matroid-greedoid intersection, is not approximable within 
2l'S'>l^ for any fixed fc > 0, unless P = NP. 

2 AfP-Hardness 

Recall that the N P-complete problem 3-satisfiability (3SAT) is, given a boolean 
formula h in 3-conjunctive normal form (3CNF), to decide whether or not there 
is a truth assignment Z for the variables of h such that h{Z) = true. 

We construct the instance {Sh, Fh), {Sh, Gh) of matroid-greedoid intersection 
that corresponds to h as follows. Let h contain n different boolean variables. Then 
Sh contains symbols ti, /i, . . . , /„. The symbols ti, /i, . . . , /„ will be used 

to encode truth assignments: fi encodes that the Ah variable is true and fi that 
it is false. 

The subset collection Fh consists of all subsets of Sh that contain at most 
one of symbols ti, fi for z = 1, . . . , n. It is immediate, that (Sh,Fh) satisfies the 
matroid properties (MI), (M2) and (M3). 

The subset collection Gh consists of two groups. The first group A consists 
of all subsets X of Sh such that \X\<n and A fl {t„, /„} = 0. The second group 
B consists of the sets that represent a truth assignment that satisfies h. Such a 
set is of size n and contains one element from each U., fi. 

To verify that {Sh,Gh) is a greedoid, first note that (Ml) is obviously true. 
To verify (M3), let X,Y G Gh such that \X\ > \Y\. 

1. If |A| < n then X and Y must belong to group A. Hence for any element 

a: G A \ y, set E U {x} belongs to group A and hence to Gh. 

2. If |A| = n and |A \ F| = 1 then F U (A \ A) = A, i.e., property (M3) holds. 

3. In the remaining case |A| = n and |A\F| > 1. As A\F contains at least 
two elements and no set of Gh contains both and fn, at least one element 
X G X\Y must be different from tn, fn- Then Y U {x} belongs to group A. 

The matroid-greedoid intersection FhldGh contains a set A such that |A| = n 
if and only if the group B in the definition of Gh is non-empty, that is, if and 
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only if h is satisfiable. As such a set X is also the largest in Fh IT Gh, we have 
shown: 

Lemma 1. Boolean formula h is satisfiable if and only if the maximum element 
in Fh n Gh for matroid {Sh, Fh) and greedoid {Sh, Gh) is of size n where n is the 
number of variables of h. 

The above construction yields a matroid family T = {{Sh, Fh)he 3 Ci^F} and 
a greedoid family Q — {{Sh, Gh)h&3CF!F}- Both have a uniform polynomial-time 
oracle for checking membership in Fh and Gh'- The only nontrivial task of the 
oracle is to verify when a truth assignment satisfies a given formula h, but this 
is doable in polynomial time in \h\ using well-known techniques. As \h\ — 0{n^) 
for a 3CNF formula h and |5'?t| = 2n, the running time of the oracle is in fact 
polynomial in | S'/ll, as required. 

It follows from Lemmanthat our construction is a polynomial-time reduction 
of 3SAT to the maximum matroid-greedoid intersection problem. Therefore we 
have the following. 

Theorem 1. The maximum interseetion problem for a matroid family and a 
greedoid family that are given by uniform polynomial-time oraeles is NP-hard. 

Also the maximum weight matroid-greedoid intersection is A^P-hard since 
maximum matroid-greedoid intersection is its special case. In this problem one 
should find, given integer weights w{x) for x G Sh, a set A G T Gh such that 
w{x) is maximum. 



3 Inapproximability 

As the maximum matroid-greedoid intersection problem is a maximization prob- 
lem whose exact solution turned out to be AP-hard, it is of interest to see 
whether or not an approximation algorithm with a performance guarantee is 
possible. An approximation algorithm would find an element in the intersection 
of the matroid and the greedoid which is not necessarily the largest one. 

Following the standard approach (see e.g. m), we say that maximization 
problem is polynomial-time approximable within r where r is a function from N 
to N if there is a polynomial-time algorithm that finds for each instance x of the 
problem a feasible solution with value c{x) such that 



^Max (^) 

c{x) 



< r(|a;|) 



where CMax{x) is the largest possible value (the optimal value) of a feasible 
solution of X. The performance ratio of such an approximation algorithm is 
bounded by the performance guarantee r. 

Theorem 2. The maximum interseetion problem for a matroid family and a 
greedoid family with domains {Sh ■ h G H}, given by uniform polynomial-time 
oraeles, is not polynomial-time approximable within for any fixed e > 0, 

unless P = AP. 
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Proof. Assume that for some e > 0, the maximum matroid-greedoid intersection 
is polynomial-time approximable within We show that then we can solve 

3SAT in polynomial time. 

As in the proof of Lemma ^ let h again be a boolean formula with n variables 
in 3-conjunctive normal form. Now set Sh contains in addition to the truth as- 
signment symbols ti, /i, • . • , /n also some indicator elements Pi, 1 < i < /(e). 
Here the number of indicators, /(e), depends on e as will be shown below. The 
indicators are needed for padding the elements of the matroid and the greedoid 
such that the maximum intersection becomes for a satisfiable h sufficiently larger 
than for a non-satisfiable h. 

We again construct a matroid {Sh,Fh) and {Sh,Gh) as follows. 

The subset collection Fh contains all subsets of St that do not contain both 
ti and fi for any 1 < i < n. It is again clear, that {Sh,Fh) satisfies properties 
(Ml) and (M2). As regards (M3), let X,Y G F^ such that |A| > |y|. If there 
is some indicator a; in A \ F, then Y U {x} G Fh- Otherwise X must contain 
more truth assignment symbols that Y . Then there must be index i such that 
either ti or fi, call it x, belongs to X but neither of ti and fi belongs to Y. Then 
Y U {a;} G Fh- Thus {Sh, Fh) is a matroid. 

The subset collection Gh consists of three groups. Group A and B are exactly 
same as in the construction of Lemma H Hence the sets in groups A and B do 
not contain any indicator elements. Group G consists of the sets of size n in 
groups A and B, padded with indicators in all possible ways. That is, if X G A 
or X G B such that |A| = n and Q is a non-empty subset of {p\, . . . ,p/(e)}, then 
X UQ belongs to group G. 

To verify that (Sh,Gh) is a greedoid, property (Ml) clearly holds. To verify 
(M3), let X,Y G Gh,\X\ > |F| and consider the following cases. 

1. If |F| < n then there is a truth assignment symbol x G X \ Y such that 

Y U {x} belongs to group A or to group B as shown in the proof of Lemma 

□ 

2. If |F| > n then there is there is an indicator x G X \ Y and thus Y U {x} 

belongs to group G. 

By our construction, the boolean formula h is satisfiable if and only if the 
largest element in Fh H Gh is of size \Sh\ — n = /(e) -I- n. Otherwise the size of 
the largest element is at most n — 1. 

Let now /(e) = {2n)^^'^ — 2n. Thus \Sh\ = (2n)^^*^. To test the satisfiability of 
h we use the approximation algorithm to find a approximately largest element 
of Fh n Gh- Let c be the size of this element. If h is not satisfiable then certainly 
c < n. On the other hand, if h is satisfiable, then the largest element of Fh O Gh 
is of size |S'?i| — n. Therefore 

\Sh^ < \Sh\^S 

But then 

- \Sh\^-^ - 2\Sh\^-^ 2 
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where the second inequality follows from that > 2n. Hence c > n if h is 
satisfiable and c < n if it is not. We have a polynomial-time satisfiability test 
because /(e) is a polynomial in n and hence in \h\ when e is fixed, and therefore 
the matroid family {{Sh, Fh)he 3 CNF} and the greedoid family {{Sh, Gh)he 3 CisiF} 
can be represented by uniform oracles whose run times are polynomial in 
hence in \h\. □ 



Theorem 3. The maximum weight intersection problem for a matroid family 
and a greedoid family with domains {Sh '■ h G H}, given by uniform polynomial- 
time oracles, is not polynomial-time approximable within for any fixed 

k > 0, unless P = NP. 



Proof. The construction is similar to the proof of Theorem O 

Domain Sh consists of the truth assignment symbols and only one indicator 
element. The indicator has weight — n while all the truth assignment 

symbols have weight equal to 1. Then the formula h is satisfiable if and only 
if the heaviest element in Fh IT Gh has weight . Otherwise the heaviest 

element weights less than n. 

Let c be the weight obtained by an approximation algorithm that is assumed 
to approximate the optimal solution within 21'^'* I . If h is not satisfiable then 
c < n. Otherwise 






<2is.l'“ 



c 



which means that c> n. 

We again have a polynomial-time satisfiability test because = 2n -I- 1 
and therefore the weights have polynomial-size (binary) representations in \h\ 
and hence the weighted set Sh can be constructed in polynomial time in \h\. 
Moreover, the uniform oracles for the families of (Sh,Fh) and (Sh,Gh) can be 
made to run in polynomial time in \h\ as before. □ 
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